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Complex systems are very often organized under the form of networks where nodes and edges 
are embedded in space. Transportation and mobihty networks, Internet, mobile phone networks, 
power grids, social and contact networks, neural networks, are all examples where space is relevant 
and where topology alone does not contain all the information. Characterizing and understanding 
the structure and the evolution of spatial networks is thus crucial for many different fields ranging 
from urbanism to epidemiology. An important consequence of space on networks is that there 
is a cost associated to the length of edges which in turn has dramatic effects on the topological 
structure of these networks. We will expose thoroughly the current state of our understanding 
of how the spatial constraints affect the structure and properties of these networks. We will 
review the most recent empirical observations and the most important models of spatial networks. 
We will also discuss various processes which take place on these spatial networks, such as phase 
transitions, random walks, synchronization, navigation, resilience, and disease spread. 
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I. NETWORKS AND SPACE 

A. Introduction 

Networks (or graphs) were for a long time the subject 
of many studies in mathematics, mathematical sociology, 
computer science and in quantitative geography. In the 
case of random networks, the first and most important 
model was proposed by Erdos and Renyi [122, 123] at 
the end of the 1950s and was at the basis of most studies 
until recently. The interest in networks was however re- 
newed in 1998 by Watts and Strogatz [313] who extracted 
stylized facts from real-world networks and proposed a 
simple, new model of random networks. This renewal in- 
terest was reinforced after the publication, a year later, of 
an article by Albert and Barabasi [27] on the existence of 
strong degree heterogeneities. Strong heterogeneities are 
in sharp contrast with the random graphs considered so 
far and the existence of strong fluctuations in real-world 
networks triggered a wealth of studies. A decade later, 
we can now find many books [32, 70, 87, 114, 245, 255] 
and reviews on this subject [7, 54, 113, 244]. These books 
and reviews discuss usually very quickly spatial aspects 
of networks. However, for many critical infrastructures, 
communication or biological networks, space is relevant: 
most of the people have their friends and relatives in their 
neighborhood, power grids and transportation networks 
depend obviously on distance, many communication net- 
work devices have short radio range, the length of axons 
in a brain has a cost, and the spread of contagious dis- 
eases is not uniform across territories. In particular, in 
the important case of the brain, regions that are spa- 
tially closer have a larger probability of being connected 
than remote regions as longer axons are more costly in 
terms of material and energy [68]. Wiring costs depend- 
ing on distance is thus certainly an important aspect of 
brain networks and we can probably expect spatial net- 
works to be very relevant in this rapidly evolving topic. 
Another particularly important example of such a spa- 
tial network is the Internet which is defined as the set 
of routers linked by physical cables with different lengths 
and latency times. More generally, the distance could 
be another parameter such as a social distance measured 
by salary, socio-professional category differences, or any 
quantity which measures the cost associated with the for- 
mation of a link. 

All these examples show that these networks have 
nodes and edges which are constrained by some geometry 
and are usually embedded in a two- or three-dimensional 
space and this has important effects on their topologi- 
cal properties and consequently on processes which take 
place on them. If there is a cost associated to the edge 
length, longer links must be compensated by some ad- 
vantage such as for example being connected to a well- 
connected node — that is, a hub. The topological aspects 
of the network are then correlated to spatial aspects such 
as the location of the nodes and the length of edges. 



B. Quantitative geography and networks 

Spatial networks were actually the subject a long time 
ago of many studies in quantitative geography. Objects 
of studies in geography are locations, activities, flows 
of individuals and goods, and already in the 1970s the 
scientists working in quantitative geography focused on 
networks evolving in time and space. One can consult 
for example books such as the remarkably modern 'Net- 
works analysis in Geography' by Haggett and Chorley 
(pubUshed in 1969 [150]) or 'Models in Geography' [81] 
to realize that many modern questions in the complex 
system field are actually at least 40 years old. In these 
books, the authors discuss the importance of space in 
the formation and evolution of networks. They develop 
tools to characterize spatial networks and discuss possi- 
ble models. Maybe what was lacking at that time, were 
datasets of large networks and larger computer capabil- 
ities, but a lot of interesting thoughts can be found in 
these early studies. Most of the important problems such 
as the location of nodes of a network, the evolution of 
transportation networks and their interaction with pop- 
ulation and activity density are addressed in these earlier 
studies, but many important points still remain unclear 
and will certainly benefit from the current knowledge on 
networks and complex systems. Advances in complex 
networks already helped researchers to gain new insights 
on these difhcult problems (see for example the recent 
Handbook on Theoretical and Quantitative Geography 
[44] ) and the present review is an attempt to collect mod- 
ern results on networks and to help researchers in various 
fields to reach quantitative answers and realistic model- 
ing. In geography and urban studies, it would be about 
understanding the evolution of transportation networks, 
the human mobility, the spatial structure of urban areas, 
etc. and how these different factors are entangled with 
each other, in order to propose an integrated approach 
of scale, mobility, and spatial distribution of activities at 
various scales. 



C. What this review is (not) about 

The importance of spatial networks in current prob- 
lems, together with the lack of a review specific on this 
topic brought us to propose the present review article. 
We will first review the tools to characterize these net- 
works and the empirical properties of some important 
spatial networks. We then review the most important 
models of spatial network formation which allows to un- 
derstand the main effects of the spatial constraints on the 
network properties. We will also discuss how space affect 
various processes taking place on these networks such as 
walking and searching, resilience, or disease spread. 

As mentioned above, spatial networks appear in many 
different fields and we will try to cover in some detail 
the studies in these various areas. However, it is obvi- 
ously impossible to review all the existing results related 
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to spatial networks. This implies to make some choices 
and we basically restricted ourselves to (i) recent topics, 
(ii) with a sufficient body of literature, (iii) of funda- 
mental research, and (iv) with preferably applications to 
real- world systems. The goal is to summarize what we 
understand at this point about the effect of space in net- 
works. In particular, we decided not to discuss here the 
following subjects (which is not an exhaustive list). 

River networks. Rivers form spatial networks (which 
in most cases are essentially trees), and result from the 
interplay between gravity and the elevation distribution. 
These networks were the subject of many studies and 
we refer the interested reader to the books [70, 269] and 
references therein for a recent account on this subject. 

Sensor, ad hoc, and wireless networks. There is of 
course a huge literature on these subjects and which is 
mostly found in computer and engineering science. In 
this review, we will discuss some theoretical aspects of 
these problems and for more applied problems we re- 
fer the interested reader to more specialized reviews and 
books such as [6]. 

Spatial games. Game theory was recently applied to 
situations where agents are located on the nodes of a 
networks. We refer the interested reader in this particular 
field to the articles [153, 251] for example. 

Mathematical studies of planar maps. Planar graphs 
and maps [301] are combinatorial objects which are the 
object of many studies in mathematics and also in physics 
where they appear as a natural discretization of random 
surfaces used in two-dimensional quantum gravity. In 
particular these mathematical methods allow to under- 
stand the critical behavior of the Ising model on a random 
planar lattice [59]. We refer the interested reader to the 
classical book of Ambjorn, Durhuus and Jonsson [15]. 

The detailed outline of the review is the following. In 
the chapter II, we introduce the main tools to character- 
ize spatial networks. Many spatial networks are planar 
and we recall the main results on planar networks and 
how to characterize them in section II. A. In addition, 
since spatial networks mix space and topology, we need 
specific tools to characterize them that we describe in 
section II. B. 

In section III, we review the properties of real- 
world spatial networks such as transportation networks 
(III. A), infrastructure networks (III.B), mobility net- 
works (III.C), and neural networks (III.D). In these sec- 
tions, we insist on salient stylized facts which allow us 
to draw some general features of real-world networks in 
section III.E. 

In section IV, we review the most important mod- 
els of spatial networks. We divide these models in 
five large categories: we start with random geometric 
graphs (IV. A), followed by spatial generalizations of the 
Erdos-Renyi graph (IV. B) and the Watts-Strogatz model 
(IV. C). We then review models of growing spatial net- 
works in section IV. D and we end this chapter with op- 
timal networks IV.E. 

Finally, we discuss various processes which take place 



on spatial networks in section V. In particular, we will re- 
view the effects of space on transitions in the Ising model 
(V.A), random walks (V.B), navigation and searching 
(V.D), robustness (V.E), and disease spread (V.F). 

In the last section (VI), we propose a summary of the 
review about the effect of space on networks and we end 
the discussion with a list of open problems that we believe 
are interesting directions for future research. 

II. CHARACTERIZING SPATIAL NETWORKS 
A. Generalities on planar networks 

1. Spatial and planar networks 

Loosely speaking, spatial networks are networks for 
which the nodes are located in a space equipped with a 
metric. For most practical applications, the space is the 
two-dimensional space and the metric is the usual eu- 
clidean distance. This definition implies in general that 
the probability of finding a link between two nodes will 
decrease with the distance. However, it does not imply 
that a spatial network is planar. Indeed, the airline pas- 
senger networks is a network connecting through direct 
flights the airports in the world, but is not a planar net- 
work. 

With this definition of a spatial network links are not 
necessarily embedded in space: social networks for ex- 
ample connect individuals through friendship relations. 
In this case, space intervenes in the fact that the con- 
nection probability between two individuals usually de- 
creases with the distance between them. 

For many infrastructure networks however, planarity is 
unavoidable. Roads, rail, and other transportation net- 
works are spatial and to a good accuracy planar net- 
works. For many applications, planar spatial networks 
are the most important and most studies have focused on 
these examples. In this section, we will thus recall some 
basic results about planar networks and we then recall 
the standard tools for characterizing networks. In par- 
ticular, we will focus on the not-so-standard tools which 
can help in characterizing spatial networks, mixing topo- 
logical and metric features. 

2. Classical results for planar networks 

Basic results on planar networks can be found in any 
graph theory textbook (see for example [85]) and we will 
very briefly recall the most important ones. 

Basically, a planar graph is a graph that can be drawn 
in the plane in such a way that its edges do not intersect. 
Not all drawings of planar graphs are without intersection 
and a drawing without intersection is sometimes called 
a plane graph or a planar embedding of the graph. In 
real-world cases, these considerations actually do not ap- 
ply since the nodes and the edges represent in general 
physical objects. 
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In general it is not a trivial thing to check if a graph 
is planar and the Kuratowski theorem (see for example 
the textbook [85]) states that a finite graph is planar if 
and only if it does not contain a subgraph that is homeo- 
morphic to the graphs Kc, or 3 shown in Fig. 1. Many 




K5 K33 

FIG. 1 Complete graphs and 7^3,3. 

algorithms were developed to test the planarity of a given 
network (see for example [180]) and most of these meth- 
ods operate in 0{N) time. 

There are some very general facts that can be demon- 
strated for planar graphs, and among them Euler's for- 
mula is probably the best known. Euler showed that a 
finite connected planar graph satisfies the following for- 
mula 

N-E + F = 2 (1) 

where A'' is the number of nodes, E the number of edges, 
and F is the number of faces. This formula can be easily 
proved by induction by noting that removing an edge 
decreases F and E by one, leaving N — E + F invariant. 
We can repeat this operation until we get a tree for which 
F = 1 and iV = + 1 leading toA^-£' + i^ = £' + l- 
E + l = 2. 

Moreover for any finite connected planar graph we can 
obtain a bound. Indeed, any face is bounded by at least 
three edges and every edge separates two faces at most 
which implies that E > 3F/2. From Euler's formula we 
then obtain 

E <3N-6 (2) 

In other words, planar graphs are always sparse with a 
bounded average degree (k) < 6. 

We note here that if we start from a planar point 
set, we can construct various planar graph by connect- 
ing these points (see the next section on tessellations for 
example) and by definition the maximal planar graph is 
the triangulation of a planar point set such that the ad- 
dition of any edge results in a non-planar graph. Such a 
network is useful for example to assess the efficiency of 
a real-world planar network and provides an interesting 
null model. For such a maximal planar graph the num- 
ber of edges and faces are maximal and are equal to the 



bounds E = 3N — 6 and F = 2N — 5, respectively. 

Planar sets thus form faces or cells which have a certain 
shape. In certain conditions, it can be interesting to char- 
acterize statistically these shapes and various indicators 
were developed in this perspective (see [150] for a list of 
these indicators). In particular, if A is the area of a cell, 
and L the major axis, the form ratio is defined as A/L'^ 
or equivalently the elongation ratio given by VA/L. In 
the paper [206] on the road network structure, Lammer 
et al. use another definition of the form factor of a cell 
and define it as 



where ttD^ is the area of the circumscribed circle. 



3. Voronoi tessellation 

A tessellation or tiling of the plane is a collection of 
plane figures that fills the plane with no overlaps and 
no gaps. One may also speak of tessellations of parts of 
the plane or of other surfaces (generalizations to higher 
dimensions are also possible). A Voronoi diagram is de- 
termined by distances to a specified discrete set of points 
(see for example the figure 2) . Each site has a Voronoi cell 
V{s) consisting of all points closer to s than to any other 
site. The segments of the Voronoi diagram are all the 
points in the plane that are equidistant to the two near- 
est sites. The Voronoi nodes are the points equidistant 
to three (or more) sites and unless some degeneracies, all 
nodes of the Voronoi tessellation have a degree equal to 
three. The dual graph for a Voronoi diagram corresponds 
to the Delaunay triangulation for the same set of points. 

Voronoi tessellations are interesting for spatial net- 
works in the sense that they provide a natural null model 
to which one can compare a real world network. 




FIG. 2 Example of the Voronoi tessellation on a set of points. 
From [160]. 

In particular, an important case is obtained when the 
points are distributed uniformly and independently in 
the plane. In this case, one can speak of a Poisson- 
Voronoi tessellation for which some statistical properties 
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are known (see [158-160] for a short review on this sub- 
ject). In particular, the probabihty that a point has a 
n-sided Voronoi cell is given for large n by [158, 159] 



C (87r2)" 
4tt^ (2n)! 



l + O 



1 



(4) 



-2n 



which at the dominant order behaves as p„ ? 

B. Mixing space and topology 

1. Basic measures 

a. Adjacency matrix, dual graph. A graph with N nodes 
and E edges can be described by its iV x TV adjacency 
matrix A which is defined as 



A,. 



1 if i and j are connected 
otherwise 



(5) 



If the graph is undirected then the matrix A is symmetric. 
The degree of a node is the number of its neighbors and 
is then given by 



ki — ^ Ai 



(6) 



The first simple indicator of a graph is the average degree 



(fc) = l/AT^fc, = 2E/N 



(7) 



where here and in the following the brackets (•) denote 
the average over the nodes of the network. In particular 
the scaling of (fc) with N indicates if the network is sparse 
(which is the case when (fc) — >■ const, for n — ^ oo). 

The distribution of degree P{k) is usually a quantity of 
interest and can display large heterogeneities such as it is 
observed in scale- free networks (see for example [7]). We 
indeed observe that for spatial networks such as airline 
networks or the Internet the degrees are very heteroge- 
neous (see III). However, when physical constraints are 
strong or when the cost associated with the creation of 
new links is large, a cut-off appears in the degree distri- 
bution [14] and in some case the distribution can be very 
peaked. This is the case for the road network for exam- 
ple and more generally in the case of planar networks for 
which the degree distribution P{k) is of little interest. 

In some real-world cases such as the road network for 
example, it is natural to study the usual (or 'primal') 
representation where the nodes are the intersections and 
the links represent the road segment between the inter- 
section. However another representation, the dual graph 
can be of interest (see [260] . In this road network exam- 
ple, the dual graph is constructed in the following way: 
the nodes are the roads and two nodes are connected if 
there exist an intersection between the two corresponding 
roads. One can then measure the degree of a node which 



represents the number of roads which intersect a given 
road. Also, the shortest path length in this network rep- 
resents the number of different roads one has to take to 
go from one point to another. Finally, we note that even 
if the road network has a peaked degree distribution its 
dual representation can display broad distributions [184]. 



b. Clustering, assortativity, and average shortest path. 
Complex networks are essentially characterized by a 
small set of parameters which are not all relevant for 
spatial networks. For example, the degree distribution 
which has been the main subject of interest in complex 
network studies is usually peaked for planar networks, 
due to the spatial constraints. However, the clustering 
coefficient as we will see is important in the characteri- 
zation of spatial networks. For a node i of degree ki it is 
defined as 



E, 



(8) 



where Ei is the number of edges among the neighbors 
of i. This quantity gives some information about local 
clustering and is the object of many studies in complex 
networks. It is also a quantity of interest for spatial net- 
works. Indeed for Erdos-Renyi (ER) random graphs (see 
section IV. B) with finite average degree, the average clus- 
tering coefficient is simply given by 



{C) 



1 

N 



(9) 



where the brackets (•) denote the average over the net- 
work. In contrast, for spatial networks, loser nodes have a 
larger probability to be connected, leading to a large clus- 
tering coefficient. The variation of this clustering coeffi- 
cient in space can thus bring valuable information about 
the spatial structure of the network under consideration. 

The clustering coefficient depends on the number of tri- 
angles or cycles of length 3 and can also be computed by 
using the adjacency matrix A. Powers of the adjacency 
matrix give the number of paths of variable length. For 
instance the quantity gTr(A^) is the number C3 of cycles 
of length tree and is related to the clustering coefficient. 
Analogously we can define and count cycles of various 
lengths (see for example [50, 272] and references therein) 
and compare this number to the ones obtained on null 
models (lattices, triangulations, etc). 

Finally, many studies define the clustering coefficient 
per degree classes which is given by 



C{k) 



1 



N{k) 



(10) 



i/ki—k 



The behavior of C{k) versus fc thus gives an indication 
how the clustering is organized when we explore different 
classes of degrees. However, in order to be useful, this 
quantity needs to be applied to networks with a large 
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range of degree variations which is usually not the case 
in spatial networks. 

In general the degrees at the two end nodes of a link are 
correlated and to describe these degree correlations one 
needs the two-point correlation function P{k'\k). This 
quantity represents the probability that any edge starting 
at a vertex of degree k ends at a vertex of degree k' . 
Higher order correlation functions can be defined and 
we refer the interested reader to [32] for example. The 
function P{k'\k) is however not easy to handle and one 
can define the assortativity [256, 303] 

fc„n(A:) (11) 

k' 

A similar quantity can be defined for each node as the 
average degree of the neighbor 

There are essentially two classes of behaviors for the as- 
sortativity. If fc„„(fc) is an increasing function of k, ver- 
tices with large degrees have a larger probability to con- 
nect to similar nodes with a large degree. In this case, 
we speak of an assortative network and in the opposite 
case of a disassortative network. It is expected in gen- 
eral that social networks are mostly assortative, while 
technological networks are disassortative. However for 
spatial networks we will see (see section HI) that spatial 
constraints usually implies a flat function fc„„(fc). 

Usually, there are many paths between two nodes in 
a connected networks and if we keep the shortest one it 
defines a distance on the network 

^ihj) = |pat/i| (13) 

paths{i^j) 

where the length |paift,| of the path is defined as its num- 
ber of edges. The diameter of the graph can be defined as 
the maximum value of all or can also be estimated 

by the average this distance over all pairs of nodes in or- 
der to characterize the 'size' of the network. Indeed for a 
d-dimensional regular lattice with N nodes, this average 
shortest path {£) scales as 

{£} - N^^"^ (14) 

In a small-world network (see [313] and section IV. C) 
constructed over a d— dimensional lattice {£) has a very 
different behavior 

{e)r^logN (15) 

The crossover from a large-world behavior N^^'^ to a 
small- world one with logA^ can be achieved for a den- 
sity p of long finks (or 'shortcuts') [36] such that 

pN - 1 (16) 



The effect of space could thus in principle be detected 
in the behavior of {£){N). It should however be noted 
that if the number of nodes is too small this can be a 
tricky task. In the case of brain networks for example, a 
behavior typical of a three-dimensional network in N^^^ 
could easily be confused with a logarithmic behavior if 
N is not large enough. 



c. Spectral graph theory. Spectral graph theory is an im- 
portant branch of mathematics and allows to get insights 
about the structure of a graph with quantities computed 
from the eigenvalues of the graph Laplacian [83] . For ex- 
ample, the so-called algebraic connectivity is the smallest 
non-zero eigenvalue, the distribution of cycle lengths can 
be computed using a moment expansion of the eigenval- 
ues, the stationary state of a random walk and synchro- 
nization properties are governed by the largest eigenvalue 
of the adjacency matrix. In a nutshell, spectral graph 
theory thus studies the adjacency matrices of graphs and 
connect their eigenvalues to other properties. There are 
different conventions but essentially one is interested in 
the discrete Laplacian on the network defined by 

L = D-A (17) 

where Dij = kiSij is the identity matrix times the degree 
ki of node i and A is the adjacency matrix of the graph. 
If the graph is undirected the Laplacian is symmetric and 
if the graph is an infinite square lattice grid of spacing a, 
this definition of the Laplacian coincides with the contin- 
uous Laplacian when a — 0. We list here some basic 
properties of the Laplacian and we refer the interested 
reader to more advanced material such as the book [83]. 

• In the undirected case, L is symmetric and has real 
positive eigenvalues Aq = < Ai < • • • < Xn- 
The multiplicity of Aq = (whose eigenvector has 
all components equal) is equal to the number of 
connected components of the graph. 

• The eigenvalue Ai is called the algebraic connectiv- 
ity [128] and plays an important role in the charac- 
terization of the graph and, roughly speaking, the 
larger Ai, the more connected the graph. 

• The distribution of eigenvalues is important in 
many applications. For instance for random walks, 
it is related to the return probability and it can 
also be used (together with the eigenvectors) for 
visual representations of graphs known as spectral 
embeddings (see for example [219]). Another exam- 
ple concerns the synchronization of linearly coupled 
identical oscillators where a master stability func- 
tion relates the stability of synchronized solutions 
to the eigenvalues of the Laplacian (more precisely 
to the ratio A^v/Ai, see [257] and [23] for the small- 
world case). 
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d. Betweenness centrality. The importance of a node is 
characterized by its so-called centrality. There are how- 
ever many different centrality indicators such as the de- 
gree, the closeness, etc., but we will focus here on the 
betweenness centrality g(i) which is defined as [34, 35, 
131, 138, 243]. 

5W = E^ (18) 

where ast is the number of shortest paths going from s to 
t and (Jst{i) is the number of shortest paths going from s 
to t through the node i. This quantity g{i) thus charac- 
terizes the importance of the node i in the organization 
of flows in the network. Note that with this definition, 
the betweenness centrality of terminal nodes is zero. The 
betweenness centrality can similarly be defined for edges 

.(e) = E — (19) 

where crgf (e) is the number of shortest paths going from 
s to t and going through the edge e. 

The betweenness centrality of a vertex is determined by 
its ability to provide a path between separated regions of 
the network. Hubs are natural crossroads for paths and 
it is natural to observe a marked correlation between the 
average g{k) — X]i/fci=fc 5(*)/^(^) ^"^^ ^ ^ expressed in 
the following relation [35] 

g{k) ^ (20) 

where rj depends on the characteristics of the network. 
We expect this relation to be altered when spatial con- 
straints become important and in order to understand 
this effect we consider a one-dimensional lattice which 
is the simplest case of a spatially ordered network. For 
this lattice the shortest path between two nodes is sim- 
ply the euclidean geodesic and for two points lying far 
from each other, the probability that the shortest path 
passes near the barycenter of the network is very large. In 
other words, the barycenter (and its neighbors) will have 
a large centrality as illustrated in Fig. 3a. In contrast. 
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raphy, this consideration does not apply anymore and if 
we rewire more and more links (as illustrated in Fig. 3b) 
we observe a progressive decorrelation of centrality and 
space while the correlation with degree increases. In a 
lattice, the betweenness centrality depends essentially on 
space and is maximum at the barycenter, while in a net- 
work the betweenness centrality of a node depends on its 
degree. When the network is constituted of long links 
superimposed on a lattice, we expect the appearance of 
'anomalies' characterized by large deviations around the 
behavior g ^ k^. In order to characterize quantitatively 
these anomalies [31], one can compute the fluctuations of 
the betweenness centrality AMR{k) for a randomized net- 
work with the same degree distribution than the original 
network and constructed with the MoUoy-Reed algorithm 
[234]. We can then consider a node i to be anomalous 
if its betweenness centrality g{i) lies outside the interval 
[{g{k)) - iyAMR{k), {g{k)) + i/AMi?(fc)] where the value 
of v determines the confidence interval. For v « 1.952, 
the considered interval represents 95% of the nodes in 
the case of Gaussian distributed centralities around the 
average. 

The effect of shortcuts and the appearance of anoma- 
lies can already be observed in the simple case of a one- 
dimensional lattice i — 1, . . . ,N. The betweenness cen- 
trality in this case is given by 5o(*) — {i ~ 1)(-^ ~ *) with 
a maximum at i = N/2 (see Fig. 4). When one link is 
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FIG. 4 Example of how the addition of a link perturbs the 
centrality. In black, the betweenness centrality for the Id 
lattice (of size (A'^ = 100) has a maximum at the barycenter 
N/2 — 50. The addition of a link between a = 20 and b = 
30 decreases the betweenness centrality between a and b and 
increases the betweenness centrality of nodes a and b. 




added between nodes a and 6, the betweenness centrality 
of the nodes in the interval ]a, b[ decreases. For example, 
if a < N/2 < & it is not difficult to see that the variation 
of the centrality of the node N/2 is bounded by 



FIG. 3 (a) Betweenness centrality for the (one-dimensional) 6g{N/2) < —a{N — b) (21) 

lattice case. The central nodes are close to the barycenter. 

(b) For a general graph, the central nodes are usually the which basically means that the shortest paths from a 
ones with large degree. node z e [1, a] to a node j £ [b, N] follow the shortcut and 

avoid the node N/2. In contrast, the betweenness cen- 
in a purely topological network with no underlying geog- trality of the contact points increases and the between- 
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ness centrality of node a is now g{a) = gQ{a)+5g{a) where 
5g{a) is positive and essentially counts the new pairs of 
nodes which are connected by shortest going through the 
new link (a, b). 



2. Mixing topology and space 

a. Strengths. In many cases, the adjacency matrix is not 
enough to fully characterize a network. This is particu- 
larly clear for airline networks for example for which the 
links are also characterized by the average number of 
passengers Wij flying on the connection (ij) (for other 
networks, such as the Internet it can be the traffic on 
a cable, etc.). For these weighted networks, the distri- 
bution of weights is a first indication of the existence of 
possible strong heterogeneities [29]. Another important 
information is the existence of correlation between the 
weights and the degrees. Such a correlation can be easily 
displayed by the study of the weight strength of a node i 
as defined by [29, 328] 



(22) 



where the sum runs over the set r(i) of neighbors of i. 
The strength generalizes the degree to weighted networks 
and in the case of the air transportation network quan- 
tifies the traffic of passengers handled by any given air- 
port. This quantity obviously depends on the degree k 
and increases (linearly) with k in the case of random un- 
correlated weights of average {w) as 



(23) 



A relation between the strength s^{k) averaged over the 
nodes of degree k of the form 



Ak^- 



(24) 



with an exponent (5^ > 1 (or with = 1 but with 
A (w)) is then the signature of non-trivial statistical 
correlations between weights and topology. In particu- 
lar, a value > 1 signals that the typical number of 
passengers per connection is not constant and increases 
with k. 

The notion of strength can obviously be extended to 
many different types of weights. In particular, for spatial 
networks, one can define [31] the distance strength of node 
i by 



(25) 



where dEii,j) is the euclidean distance between nodes i 
and j. This quantity s'^ represents the cumulated dis- 
tances of all the connections from (or to) the considered 
airport. Similarly to the usual weight strength, uncorre- 
lated random connections would lead to a linear behavior 



of the form s'^(fc) cx k while otherwise the presence of cor- 
relations would be signaled by a behavior of the form 



\k) - k^" 



(26) 



with I3d > 1. In such a case, there are correlations be- 
tween the topology and geography which implies that 
the typical length of the connection is not constant as it 
would be in the case for — 1 but increases with the 
number of connections. 



b. Indices a, 7 and variants. Ringness. Different indices 
were defined a long time ago mainly by scientists work- 
ing in quantitative geography since the 1960s and can be 
found in [150, 268, 296] (see also the more recent paper 
by Xie and Levinson [322]). Most of these indices are 
relatively simple but give valuable information about the 
structure of the network, in particular if we are interested 
in planar networks. These indices were used to charac- 
terize the topology of transportation networks and for 
example Garrison [133] measured some properties of the 
Interstate highway system and Kansky [186] proposed 
up to 14 indices to characterize these networks. We will 
here recall the most important indices which are called 
the 'alpha' and the 'gamma' indices. The simplest index 
is called the gamma index and is simply defined by 



E 
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E„ 



(27) 



where E is the number of edges and E„iax is the maximal 
number of edges (for a given number of nodes N). For 
non-planar networks, E^ax is given by N{N — l)/2 for 
non-directed graphs and for planar graphs we saw in the 
section II. A. 2 that E^ax = — 6 leading to 



E 



IP 



3A^- 6 



(28) 



The gamma index is a simple measure of the density 
of the network but one can define a similar quantity by 
counting the number of elementary cycles instead of the 
edges. The number of elementary cycle for a network is 
known as the cyclomatic number (see for example [85]) 
and is equal to 



V = E-N + 1 



(29) 



For a planar graph this number is always less or equal 
to 27V — 5 which leads naturally to the definition of the 
alpha index (also coined 'meshedness' in [66]) 



a 



E-N + 1 
27V - 5 



(30) 



This index lies in the interval [0, 1] and is equal to for a 
tree and equal to 1 for a maximal planar graph. Using the 
definition of the average degree (fc) = 2E/N the quantity 
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a reads in the large N limit as 



(31) 



which shows that in fact for a large network this index a 
does not contain much more information than the average 
degree. 

We note that more recently, other interesting indices 
were proposed in order to characterize specifically road 
networks [100, 322]. For example, in [100], Courtat, 
Gloaguen, Douady noticed that in some cities the de- 
gree distribution is very peaked around 3 — 4 and they 
then define the ratio 



N{l) + N{3) 



(32) 



where N{k) is the number of nodes of degree k. If this 
ratio is small the number of dead ends and of 'unfinished' 
crossing {k — 3) is small compared to the number of 
regular crossings with fc = 4. In the opposite case of 
large r^r, there is a dominance of /c = 4 nodes which 
signals a more organized city. 

The authors of [100] also define the 'compactness' of 
a city which measures how much a city is 'filled' with 
roads. If we denote by A the area of a city and by £t the 
total length of roads, the compactness 5* G [0, 1] can be 
defined in terms of the hull and city areas 



* = 1 



4A 



{£t - 2VAy 



(33) 



In the extreme case of one square city of linear size L — 

\J A with only one road encircling it, the total length 

is It = 4:\/A and the compactness is then = 0. At 

the other extreme, if the city roads constitute a square 

grid of spacing a, the total length is £t — 2L^/a and in 

the limit of a/L — >■ one has a very large compactness 
l-aVL2 

We end this section by mentioning the ringness. Arte- 
rial roads (including freeways, major highways) provide 
a high level of mobility and serve as the backbone of the 
road system [322]. Different measures (with many refer- 
ences) are discussed and defined in [322] and in particular 
the ringness is defined as 



X7 



(34) 



where iring is the total length of arterials on rings and 
where the denominator £tot is the total length of all ar- 
terials. This quantity ranging from to 1 is thus an in- 
dication of the importance of a ring and to what extent 
arterials are organized as trees. 



c. Route factor, detour index. When the network is em- 
bedded in a two-dimensional space, we can define at least 



two distances between the pairs of nodes. There is of 
course the natural euclidean distance ci£;(z,j) which can 
also be seen as the 'as crow flies' distance. There is also 
the total 'route' distance df((i,j) from i to j by com- 
puting the sum of length of segments which belong to 
the shortest path between i and j. The route factor (also 
called the detour index or the circuity or directness [212]) 
for this pair of nodes («, j) is then given by (see Fig. 5 for 
an example) 



Qihj) = 



dE{i,j) 



(35) 



This ratio is always larger than one and the closer to one, 
the more efficient the network. From this quantity, we 




FIG. 5 Example of a detour index calculation. The 'as crow 
flies' distance between the nodes A and B is dsiA, B) = vTO 
while the route distance over the network is dii{A,B) = 4 
leading to a detour index equal to Q{A, B) = 4/\/l0 ~ 1.265. 

can derive another one for a single node defined by 

j 

which measures the 'accessibility' for this specific node 
i. Indeed the smaller it is and the easier it is to reach 
the node i. Accessibility is a subject in itself (see for 
example [43] ) and there are many other measures for this 
concept and we refer the interested reader to the article 
[109, 121, 152]. This quantity (Q(j)) is related to the 
quantity called 'straightness centrality' [103] 



N 



1 sr^disiij) 



(37) 



And if one is interested in assessing the global efficiency 
of the network, one can compute the average over all 
pairs of nodes (also used in [306]) 



(Q) = 



1 



N{N - 1) 



(38) 



The average (Q) or the maximum Qmax, and more gen- 
erally the statistics oi Q{i,j) is important and contains a 
lot of information about the spatial network under con- 
sideration (see [12] for a discussion on this quantity for 
various networks). For example, one can define the in- 
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teresting quantity [12] 



m 



1 



(39) 



ij /dE{i,j)=d 



(where Nd is the number of nodes such that dEihj) = 
d) whose shape can help characterizing combined spatial 
and topological properties. 



d. Cost and EfRciency. The minimum number of links to 
connect N nodes is i? = — 1 and the corresponding 
network is a tree. We can also look for the tree which 
minimizes the total length given by the sum of the lengths 
of all links 



(40) 



where dsie) denotes the length of the link e. This proce- 
dure leads to the minimum spanning tree (MST) which 
has a total length i^^'^ (see also section IV.E). Obviously 
the tree is not a very efficient network (from the point 
of view of transportation for example) and usually more 
edges are added to the network, leading to an increase of 
accessibility but also to oi It- A natural measure of the 
'cost' of the network is then given by 



C 



gMST 



(41) 



Adding links thus increases the cost but improves ac- 
cessibility or the transport performance P of the network 
which can be measured as the minimum distance between 
all pairs of nodes, normalized to the same quantity but 
computed for the minimum spanning tree 



P 



MST) 



(42) 



Another measure of efficiency was also proposed in [208, 
209] and is defined as 



E = 



N{N 



—y— 



(43) 



where £{i,j) is the shortest path distance from i to j. 
This quantity is zero when there are no paths between 
the nodes and is equal to one for the complete graph (for 
which = 1). Combination of these different indi- 

cators and comparisons with the MST or the maximal 
planar network can be constructed in order to character- 
ize various aspects of the networks under consideration 
(see for example [66]). 

Finally, adding links improves the resilience of the net- 
work to attacks or dysfunctions. A way to quantify this is 
by using the fault tolerance (FT) (see for example [298]) 
measured as the probability of disconnecting part of the 
network with the failure of a single link. The benefit /cost 



ratio could then be estimated by the quantity FT/l^^^"^ 
which is a quantitative characterization of the trade-off 
between cost and efRciency [298]. 



— 3. Community detection. IVIotifs 



a. Community detection. Community detection in 
graphs is an important topic in complex network studies 
(see the review [129]), but after almost a decade of 
efforts, there is no definitive method of identification of 
communities, but instead many different methods with 
their respective advantage and drawbacks. 

Loosely speaking, a community (or a 'module') is a set 
of nodes which have more connections among themselves 
than with the rest of nodes. One of the first and simplest 
method to detect these modules is the modularity opti- 
mization and consists in maximizing the quantity called 
modularity defined as [246] 



(44) 



where the sum is over the hm modules of the partition, 
is is the number of links inside module s, i? is the to- 
tal number of links in the network, and ds is the total 
degree of the nodes in module s. The first term of the 
summand in this equation is the fraction of links inside 
module s and the second term represents the expected 
fraction of links in that module if links were located at 
random in the network (and by keeping the same degree 
distribution) . The number of modules hm is also a vari- 
able whose value is obtained by the maximization. If for 
a subgraph S of a network the first term is much larger 
than the second, it means that there are many more links 
inside S than one would expect by random chance, so S 
is indeed a module. The comparison with the null model 
represented by the randomized network is however mis- 
leading and small modules connected by at least a link 
might be seen as one single module. This resolution limit 
was demonstrated in [130] where it is shown that mod- 
ules of size VE or smaller might not be detected by this 
method. Modularity detection was however applied in 
many different domains and is still used. In the case of 
spatial networks, it is the only method which was used 
so far but it is clear that community detection in spa- 
tial networks is a very interesting problem which might 
receive a specific answer. In particular, it would be in- 
teresting to see how the classification of nodes proposed 
in [147] applies to spatial networks. 



b. Motifs. The motifs are particular subgraphs that are 
over-represented in the network with respect to an uncor- 
related random network with the same degree distribu- 
tion. The idea of motifs is particularly important for bi- 
ological networks [231] but can be applied to any type of 
networks, including spatial networks. In many networks 



found in biology, or ecology for example, a surprisingly 
small number of motifs exists and each type of network 
has its own characteristic set of motifs. The occurrence 
of a particular motif is characterized by its normalized Z- 
score, a measure for its abundance in the network. The 
similarity of the motif distributions between different net- 
works can be a signature of the ubiquity of simple pro- 
cesses occuring in human interactions, good exchanges, 
etc. 



III. EMPIRICAL OBSERVATIONS 

In this chapter we will review the most salient proper- 
ties of real-world spatial networks. We will insist on fea- 
tures connected to the spatial aspects of these networks 
and we will give references for other aspects. 

We will begin with transportation networks which are 
emblematic of spatial networks. Generally speaking, 
transportation networks are structures that convey en- 
ergy, matter or information from one point to another. 
They appear in a variety of different fields, including city 
streets [66, 73], plant leaves [273], river networks [269], 
mammalian circulatory systems [316], networks for com- 
modities delivery [134], technological networks [278], and 
communication networks [255]. The recent availability of 
massive data sets has opened the possibility for a quanti- 
tative analysis and modeling of these different patterns. 
In a second part, we review the recent results obtained 
for infrastructure networks such as roads, power grids or 
the Internet. In particular, we will also discuss the ge- 
ographical aspects of social networks. We then review 
the recent results on mobility networks which describe 
the statistics of human movements. We end this chap- 
ter with studies on neural networks, including the brain 
which is an important example of a complex network with 
a spatial component. Finally, we propose at the end of 
this chapter a recap under the form of a table (see table 
II). 

A. Transportation networks 

The paper of Watts and Strogatz [313] triggered many 
studies on networks and in particular motivated empir- 
ical analysis of various networks such as transportation 
networks. These networks control many aspect in our so- 
cieties and govern many modern problems such as disease 
spread, congestion, urban sprawl, structure of cities. In 
this section we review the main results concerning their 
structure. 



1. Representations 

In [203], Kurant and Thiran discuss very clearly the 
different possibilities to represent a transportation sys- 
tem (Fig. 6). Indeed, the simplest representation is ob- 
tained when the nodes represent the stations and links 
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FIG. 6 (a) Direct representation of the routes (here for three 
different routes), (b) Space-of-changes (sometimes called P 
space [207, 283, 286])). A link connects two nodes if there 
is at least one vehicle that stops at both nodes, (c) Space- 
of-stops. Two nodes are connected if they are consecutive 
stops of at least one vehicle, (d) Space-of-stations. Here two 
stations are connected only if they are physically connected 
(without any station in between) and this network reflects the 
real physical network. From [203]. 



the physical connections. One could however construct 
other networks such as the space-of-stops or the space- 
of-changes (see Fig. 6). One also finds in the literature 
on transportation systems, the notions of L and P-spaces 
[165, 283] where the L-space connects nodes if they are 
consecutive stops in a given route. The degree in L-space 
is then the number of different nodes one can reach within 
one segment and the shortest path length represents the 
number of stops. In the P-space, two nodes are connected 
if there is at least one route between them so that the de- 
gree of a node is the number of nodes that can be reached 
directly. In this P-space the shortest path represents the 
number of connections needed to go from one node to 
another. 



2. Airline networl<s 

The airline transportation infrastructure is an im- 
portant example of a spatial network. The nodes are 
identified to the airports which are located in a two- 
dimensional space. The location of the nodes are not 
uniformly distributed and are determined by exogenous 
factors. The links represent the existence of a direct fiight 
among them and obviously this network even if clearly 
spatial is not planar. 
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a. Weight and spatial heterogeneity. The usual character- 
istics of the world-wide air-transportation network using 
the International Air Transportation Association (lATA) 
database lATA ^ have been presented in [29]. The net- 
work is made of TV = 3880 vertices and E = 18810 
edges (for the year 2002) and displays both small-world 
and scale-free properties as also confirmed in different 
datasets and analysis [146, 149, 216]. In particular, the 
average shortest path length, measured as the average 
number of edges separating any two nodes in the net- 
work shows the value {£) = 4.37, very small compared 
to the network size TV but largely overestimated due to 
the presence of long paths between far remote areas (we 
can expect it to decrease if we weight the paths by their 
number of passengers). 



The degree distribution, takes the form 



(45) 



where 7 ~ 2.0 and f{k/k^) is an exponential cut-off func- 
tion. The degree distribution is therefore heavy-tailed 
with a cut-off that finds its origin in the physical con- 
straints on the maximum number of connections that 
a single airport can handle [14, 146, 149]. The airport 
connection graph is therefore a clear example of a spa- 
tial (non-planar) small-world network displaying a heavy- 
tailed degree distribution and heterogeneous topological 
properties. 

The world-wide airline network necessarily mixes dif- 
ferent effects. In particular there are clearly two different 
scales, global (intercontinental) and domestic. The in- 
tercontinental scale defines two different groups of travel 
distances and for the statistical consistency the study 
[31] focused on a single continental case with the North- 
American network constituted of TV = 935 vertices with 
an average degree (fc) « 8.4 and an average shortest path 
{£) Ri 4. The statistical topological properties of the 
North American network are consistent with the world- 
wide one: the degree is distributed according to a power 
law with exponent w 1.9 on almost two orders of magni- 
tude followed by a cut-off indicating the maximum num- 
ber of connections compatible with the limited airport 
capacity and the size of the network considered. The 
spatial attributes of the North American airport network 
are embodied in the physical spatial distance, measured 
in kilometers, characterizing each connection. Fig. 7 dis- 
plays the distribution of the distances of the direct flights. 
These distances correspond to Euclidean measures of the 
links and clearly show a fast decaying behavior reason- 
ably fitted by an exponential. The exponential fit gives a 
value for a typical scale of the order 1000 kms. The origin 
of the finite scale can be traced back to the existence of 
physical and economical restrictions on airline planning 
in a continental setting. In a recent paper [51] however 
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FIG. 7 Distribution of distances (in kms) between airports 
linked by a direct connection for the North American network. 
The straight line indicates an exponential decay with scale of 
order 1,000 kms. From [31]. 



Bianconi, Pin, Marsili study the connection probability 
Pij for two airports and found that for the US airline 
network (with iV = 675 airports and E = 3253 connec- 
tions) it behaves as a power law dependence of the form 
Pij cx dE{hj)~°' with a « 3 for d > 100 kms. Although 
the two quantities pij and P{d) are different {pij is the 
probability used to construct the network and P{d) is 
the resulting distribution obtained on the specific set of 
airports distributed across the US territory) it is unclear 
at this stage if these results are consistent. 

In addition these networks are weighted and the weight 
on each link represents the number of maximum passen- 
gers on the connection. The traffic at a node can then be 
studied by the weight strength (see section II.B.2.a) and 
behaves here as a power law of the degree 



(46) 



with an exponent ~ 1.7 (Fig. 8) demonstrating the ex- 
istence of strong correlations between the degree and the 
traffic. The distance strength (see the section II.B.2.a) 
also behaves as a power law of the degree 



\k) - k^" 



(47) 



^ http://www.iata.org. 



with (3d — 1.4 (Fig. 8). This result shows the presence 
of important correlations between topology and geogra- 
phy. Indeed, the fact that the exponents appearing in the 
relations Eq. (46) and Eq. (47) are larger than one have 
different meanings. While Eq. (46) means that larger air- 
ports have connections with larger traffic, Eq. (47) im- 
plies that they have also farther-reaching connections. 
In other terms, the traffic and the distance per connec- 
tion is not constant but increases with k. As intuitively 
expected, the airline network is an example of a very 
heterogeneous network where the hubs have at the same 
time large connectivities, large weight (traffic) and long- 
distance connections [29], related by super- linear scaling 
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FIG. 8 Weight and distance strengths versus degree for the 
North American network. The dashed lines correspond to the 
power-laws with exponents /3d ~ 1.4 and [3^ — 1.7. From [31]. 
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FIG. 9 Assortativity and clustering for the North American 
network. Circles correspond to topological quantities while 
the squares represent the weighted assortativity and cluster- 
ing. From [31]. 



b. Assortativity and Clustering. A complete characteri- 
zation of the network structure must take into account 
the level of degree correlations and clustering present in 
the netvifork. Fig. 9 displays for the North- American air- 
port network the behavior of these various quantities as 
a function of the degree. An essentially fiat knn{k) is 
obtained and a slight disassortative trend is observed at 
large k, due to the fact that large airports have in fact 
many intercontinental connections to other hubs which 
are located outside of North America and are not con- 
sidered in this 'regional' network. The clustering is very 
large and is slightly decreasing at large k. This behav- 
ior is often observed in complex networks and is here 
a direct consequence of the role of large airports that 
provide non-stop connections to different regions which 
are not interconnected among them. Moreover, weighted 
correlations are systematically larger than the topolog- 
ical ones, signaling that large weights are concentrated 
on links between large airports which form well inter- 
connected cliques (see also [29] for more details). 



c. Betweenness Centrality. As discussed in the section 
II.B.l.d, the betweenness centrality is a very interesting 
quantity which characterizes the importance of a node in 
the network. The more central the node and the larger 
the number of shortest paths going through this node, 
it is thus natural to observe strong correlations between 
the betweenness centrality and the degree. However for 
a lattice, the betweenness centrality is maximal at the 
barycenter of all nodes and we thus expect for real spa- 
tial networks an interesting interplay between the degree 
of the node and its distance to the barycenter. 

More precisely, it is generally useful to represent the 
average betweenness centrality for vertices of the same 



degree 

where N{k) is the number of nodes of degree k. For most 
networks, g{k) is strongly correlated with the degree k. 
In general, the larger the degree, the larger the central- 
ity. For scale-free networks it has been shown that the 
centrality scales with k as 

g{k) ^ fc" (49) 

where 77 depends on the network [35, 138]. For some net- 
works however, the betweenness centrality fluctuations 
around the behavior given by Eq. (49) can be very large 
and 'anomalies' can occur, in the sense that the variation 
of the centrality versus degree is not a monotonous func- 
tion. Guimera and Amaral [149] have shown that this 
is indeed the case for the worldwide air-transportation 
network. This is a very relevant observation in that very 
central cities may have a relatively low degree and vice 
versa. Fig. 10 displays the average behavior along with 
the scattered plot of the betweenness versus degree of all 
airports of the North American network. In this case we 
find also very large fluctuations with a behavior similar 
to that observed in [146]. 

These different observations call for the need of a net- 
work model embedded in real space which can reproduce 
these betweenness centrality features observed in real 
networks. More generally these different results point 
out the importance of space as a relevant ingredient in 
the structure of networks. In the chapter about mod- 
els (section IV) we focus on the interplay between spa- 
tial embedding, topology and weights in a simple general 
model for weighted networks in order to provide a mod- 
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FIG. 10 Scatter-plot of the betweenness centrality versus de- 
gree for nodes of the North American air-transportation net- 
work. The (red) squares correspond to the average between- 
ness centrality versus degree. From [31]. 



eling framework considering these three aspects at once. 



d. Communities. In [149], Guimera et al. used modu- 
larity optimization with simulated annealing (see section 
II. B. 3) in order to identify communities defined by the 
worldwide air network. The obtained result is shown in 




FIG. 11 Communities of the worldwide air transportation 
network obtained with modularity optimization. Each node 
represents an airport and each color corresponds to a commu- 
nity. From [149]. 

Fig. 11 where each color corresponds to a community. We 
immediately can see that most of these communities are 
actually determined by geographical factors and there- 
fore is not very informative: the most important flows 
are among nodes in the same geographical regions. More 
interesting are the spatial anomalies which belong to a 
community from the modularity point of view but which 
are in another geographical region. For example, the 'red' 
community of western Europe also contains airports from 
Asian Russia. More generally, it is clear that in the case 
of spatial networks, community detection offers a visual 
representation of large exchange zones. It also allows to 
identify the inter-communities links which probably play 



a very important role in many processes such as disease 
spread for example. 



3. Bus, subway, railway, and commuters 

a. Subways. One of the first studies (after the Watts- 
Strogatz paper) on the topology of a transportation net- 
work was proposed by Latora and Marchiori [207] who 
considered the Boston subway network. It is a relatively 
small network with N = 124 stations. The average short- 
est path is (^) ^ 16 a value which is large compared 
to In 124 « 5 and closer to the two-dimensional result 
7124 ~ 11. 

In [286] Sienkiewicz and Holyst study a larger set made 
of public transportation networks of buses and tramways 
for 22 Polish cities and in [305] von Ferber et al. study the 
public transportation networks for 15 world cities. The 
number of nodes of these networks varies from = 152 
to 2811 in [286] and in the range [1494,44629] in [305]. 
Interestingly enough the authors of [286] observe a strong 
correlation between the number of stations and the pop- 
ulation which is not the case for the world cities studied 
in [305] where the number of stations seems to be in- 
dependent from the population^. For the polish cities 
the degree has an average in the range [2.48, 3.08] and in 
a similar range [2.18,3.73] for [305]. In both cases, the 
degree distribution is relatively peaked (the range of vari- 
ation is usually of the order of one decade) consistently 
with the existence of physical constraints [14]. 

Due to the relatively small range of variation of A^ in 
these various studies [207, 286, 305], the behavior of the 
average shortest path is not clear and could be fitted by 
a logarithm or a power law as well. We can however note 
that the average shortest path is usually large (of order 
10 in [286] and in the range [6.4, 52.0] in [305]) compared 
to InA^, suggesting that the behavior of {£) might not 
be logarithmic with A^ but more likely scales as N^^^, a 
behavior typical of a two-dimensional lattice. 

The average clustering coefficient (C) in [286] varies 
in the range [0.055,0.161] and is larger than a value of 
the order Cer ~ 1/-^^ ~ 10^'^ — 10^^ corresponding to a 
random ER graph. The ratio (C) /Cer is explicitly con- 
sidered in [305] and is usually much larger than one (in 
the range [41, 625]). The degree-dependent clustering co- 
efficient C(fc) seems to present a power law dependence, 
but the fit is obtained over one decade only. 

In another study [279], the authors study two urban 
train networks (Boston and Vienna which are both small 
A^ = 124 and A^ = 76, respectively) and their results are 
consistent with the previous ones. 

Due to the limited availability of data, there are less 
studies on the passenger flows and the data so far seem to 



^ Incidentally, connecting socio-economical indicators to proper- 
ties of these networks is certainly a very interesting and difficult 
question which probably will be addressed in the future. 
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indicate that the flow is broadly distributed [211, 270] a 
feature which seems to be common to many transporta- 
tion systems (see Table II). 



b. Rail. One of the first studies of the structure of rail- 
way network [283] concerns a subset of the most impor- 
tant stations and lines of the Indian railway network and 
has N = 587 stations. In the P-space representation (see 
section III.A.l), there is a link between two stations if 
there is a train connecting them and in this representa- 
tion, the average shortest path is of order {£) « 5 which 
indicates that one needs 4 connections in the worst case 
to go from one node to another one. In order to obtain 
variations with the number of nodes, the authors con- 
sidered different subgraphs with different sizes N. The 
clustering coefficient varies slowly with TV is always larger 
than w 0.7 which is much larger than a random graph 
value of order 1/A^. Finally in this study [283], it is shown 
that the degree distribution is behaving as an exponen- 
tial and that the assortativity (/c„„)(k) is flat showing an 
absence of correlations between the degree of a node and 
those of its neighbors. 

In [203], Kurant and Thiran study the railway system 
of Switzerland and major trains and stations in Europe 
(and also the public transportation system of Warsaw, 
Poland). The Swiss railway network contains N = 1613 
nodes and E — 1680 edges (Fig. 12). All conclusions 




FIG. 13 EU rail dataset. (a) Physical layout of the network 
showing node of size proportional to (a) the degree, (b) the 
node betweenness centrality , (c) the real load. In (d, top) 
the degree versus the centrality is shown and in (d, bottom) 
the betweenness versus the real load. [202] . 

indicators is poor (with a Pearson correlation coefficient 
equal to 0.5 for betweenness centrality— fc and 0.26 for 
the correlation betweenness centrality-real load). As ex- 
pected from general considerations developed in the sec- 
tion II.B.l.d, we see large fluctuations in the relation k 
and the betweenness centrality, but surprisingly here, the 
degree seems to be a better indicator of the real load than 
the betweenness centrality [202]. 

c. Urban commuters network. As discussed above, it is 
important to know how the flows of individuals are struc- 
tured in a dense urban area (Theoretical discussions 
about this problem can be found in the section III.C.3 
about the so-called gravity law). In [82], Chowell et al. 
study the simulated movements of 1.6 million individu- 
als during a typical day in the city of Portland (Oregon, 
USA) and construct a network where the nodes represent 
the locations (buildings, homes, etc) and where the edges 
represent the flow of individuals between these locations 
(see Fig. 14). The distribution of the outdegree (which 




FIG. 12 Physical map of the Swiss railway networks. 
[203]. 
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drawn here are consistent with the various cases pre- 
sented in this chapter. In particular, the average degree 
is (k) « 2.1, the average shortest path is ~ 47 (consis- 
tent with the \/N result for a two-dimensional lattice), 
the clustering coefficient is much larger than its random 
counterpart, and the degree distribution is peaked (ex- 
ponentially decreasing). 

Finally, in [202] Kurant and Thiran studied networks 
by decomposing it into layers (see Fig. 13). In particu- 
lar, they compare the spatial distribution of the degree, 
the node betweenness centrality and the real load. As 
shown in Fig. 13d the correlation between these different 
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FIG. 14 Construction of the network between the locations 
in Portland as reported in [82]. The weight Wij on the link 
i — j represents the daily traffic from i to j. After [82]. 

is the number of edges leaving a node) , and of the corre- 
sponding out-traffic, are well-fitted by power laws. The 
clustering coefficient for different types of locations are 
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all large and when all activities are aggregated, the corre- 
sponding clustering coefficient behaves as 1/fc (Fig. 15). 
It thus seems that at this relatively small scale, space 
is not constraining enough (or in other words the cost 
variations are too small) and the important features of 
spatial networks do not seem to appear here. 
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FIG. 15 Loglog plots of the clustering coefScient C(fc) as a 
function of the out-degree for subnetworks constructed from 
work activities, school activities, social activities, and all the 
activities grouped together (the dotted line has slope 1). For 
social and recreational activities C{k) ~ 1/fc but for the net- 
works constructed from work activities, the clustering coeffi- 
cient is almost constant. From [82]. 

At a larger scale, de Montis et al. [107] studied the net- 
work representing the interurban commuting traffic of the 
Sardinia island (Italy), which amounts to 375 municipal- 
ities and 1.6 millions inhabitants. The nodes correspond 
to towns and the edges to the actual commuting flows 
among those. This network has a peaked distribution of 
degrees and similarly to the urban case studied in [82], 
the flows are very heterogeneous and can be fitted by a 
power law with exponent ss 1.8. In contrast with [82] 
where the relation between the strength and the degree 
is linear, it is strongly non-linear with /3 1.9 in this 
inter-urban case. 

Still in the case of commuters, a recent study [75] 
used modularity optimization to determine communities 
from this inter-urban commuters weighted network and 
showed that there is a consistency between the obtained 
communities and the administrative regions. A similar 
result was obtained using bank notes dispersal in [300]. 
Community detection was also used on the Belgian mo- 
bile phone network to identify language communities [53] . 
Community detection thus appears as an interesting tool 
for defining scientifically administrative boundaries and 
more generally for policy making and planning. 



4. Cargo-ship 

The last example of transportation network that we 
will discuss in this chapter is the cargo ship network [165, 



185, 324]. This network is particularly important as more 
than 90% of the world trade is carried by the sea. 

Cargo ships are not all the same and can be either bulk 
dry carriers, oil tankers or can be designed to transport 
containerized cargo. These different cargo types define 
different networks which can display different properties 
[185]. In the case of containerized cargo, many impor- 
tant statistical parameters were measured in [165] and 
we summarize here their results. The dataset consists in 
N = 878 ports and 1802 different lines giving 7955 edges. 
In the L-space representation (see section III.A.l), the 
nodes are the ports and there is a link between two ports 
when they are connected by a direct route. The short- 
est path between two ports in this representation thus 
represents the number of stops needed to travel between 
these two nodes. The average degree of the undirected 
version of this network is (k) « 9, the average cluster- 
ing coefficient is (C) « 0.40 which is much larger than 
1/878 ~ 10~^. The average shortest path is {£) w 3.6 and 
the out-, in- and undirected degree distributions seem 
to be relatively broad with maximum values as large as 
k ~ 10^. More clearly the weight (defined as the num- 
ber of container lines for a given time period) and the 
strength (see section ??) are broadly distributed. The 
power law fit proposed in [165] gives a behavior for the 
weight P{w) ~ 1/w (for w < 100, see Fig. 16). The 
strength (see II. B. 2) versus the degree display a nonlin- 
ear behavior (Fig. 16) typical of these networks. The 
exponent is of order /3 ~ 1.3 not far from the values 
/3 ~ 1.4 — 1.5 measured for the worldwide network. 

For this weighted network, weighted clustering and 
weighted assortativity were also measured [165] showing 
that C""(fc) > C{k) and kl^^{k) > fc„„(fc) which is the 
typical sign that most of the traffic takes place on link 
connecting various hubs. Also this network is weakly as- 
sortative with an almost flat fc„„(fc). 

The behavior of the betweenness centrality with the 
degree is interesting and is shown in Fig. 17 where we 
see that anomalies are very rare: the larger the degree 
and the larger the betweenness centrality. This particu- 
lar feature (which makes this network very different from 
the worldwide air travel network) might come from the 
fact that geographical constraints are less severe for cargo 
ships. We saw that the existence of anomalies could be 
linked to the fact that long links are difficult to create 
and must be compensated by a large degree. Longer 
links here are maybe less costly and constraints there- 
fore smaller leading to less anomalies and to a behavior 
closer to a power law {g{k)) ^ k^ typical of standard 
non-spatial networks. 

Recently, in [185] Kaluza et al. studied a larger dataset 
(obtained for the year 2007) with 490, 517 non-stop jour- 
neys between 36, 351 distinct pairs of departure and ar- 
rival ports. They also found a small average shortest 
path {£) « 2.5 with more than 50% of possible origin- 
destination pairs that can be connected by two steps or 
less. This study conflrms the facts that the average clus- 
tering is large, and that the degree, weight, and strength 
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FIG. 16 (a) Weight distribution for the worldwide maritime 
network (the power law fit gives an exponent ~ 0.95). (b) 
Strength versus degree showing a nonlinear behavior (the fit 
gives an exponent ~ 1.3). From [165]. 
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FIG. 17 Betweenness centrality versus the degree k. The 
power law fit for the L-space gives 1.66 and 1.93 in the P- 
space. From [165]. 



distributions are large. The exponent of the weight (de- 
fined here as the available space on all ships) distribution 
is equal to « 1.7 and is different from the study [165]. 

The weight is also very asymmetric (ie. Wij ^ Wji) 
with 59% of all linked pairs which exist in one direction 
only. This feature can also be observed in other trans- 



portation networks (such as mail or freight [96]) and is 
probably the sign of distribution networks as opposed to 
travel networks where basically every individual performs 
a roundtrip implying symmetrical weights Wij « Wji. 

The strength in [185] also scales as a power law of 
the degree with an exponent ~ 1.46 not far from the 
values of the exponents found in other networks. Fi- 
nally the betweenness centrality scales well with the de- 
gree and displays very few anomalies in agreement with 
[165]. After these rather standard measurements, the au- 
thors of [185] proposed other measurements. They first 
characterized the community structure (Figure 18) for 
three subnetworks of this cargo network obtained for dif- 
ferent ship types (containers, bulk dry carriers, and oil 
tankers). These communities were obtained by optimiz- 




FIG. 18 Communities of ports obtained for the cargo net- 
works obtained for different types of ships: (a) container 
ships, (b) bulk dry carriers, and (c) oil tankers. The optimal 
values of the number of communities c and the modularity 
Q are shown in the bottom right corner of each panel. From 
[185]. 

ing the modularity Q (see section II. B. 3). This procedure 
might miss some small structures [130] but at large scale 
can be trusted. Indeed in this case, groups are observed 
which are geographically consistent demonstrating the 
relevance of this method in this case. One could even 
think that community detection might be an important 
tool in geography and in the determination of new ad- 
ministrative or economical boundaries. 

Finally, the authors of [185] studied the statistical oc- 
currence of motifs, a method developed in a biological 
context [231]. In Fig. 19, we reproduce the motif dis- 
tribution for the three main cargo fleets compared with 
the results for the Web and social networks displaying 
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FIG. 19 Z-score for the different motifs shown at the bottom 
of the figure. Results are shown for the three main cargo 
fleets and for comparison with the WWW and social networks. 
From [185]. 



surprising similarities pointing to the existence of 'supe- 
runiversal' features in complex networks. 



B. Infrastructure networks 

Our modern societies depend crucially on infrastruc- 
ture networks. Movement of people on roads and streets, 
water and gas distribution, power grids, and communi- 
cation networks are all networks vital to the good func- 
tioning of the societies. In this chapter, we will review 
the most important and recent results on their structure 
in relation with spatial aspects. 



1. Road and street networks 

Despite the peculiar geographical, historical, social- 
economical mechanisms that have shaped distinct urban 
areas in different ways (see for example [214] and refer- 
ences therein), recent empirical studies [42, 73, 84, 102, 
175, 176, 206, 220] have shown that, at least at a coarse 
grained level, unexpected quantitative similarities exist 
between road networks of very different cities. The sim- 
plest description of the street network consists of a graph 
whose links represent roads, and vertices represent roads' 
intersections and end points. For these graphs, links in- 
tersect essentially only at vertices and are thus planar."^ 
Measuring spatial properties of cities through the anal- 
ysis of the street network is not new and was popular- 



^ For roads, highways, etc. planarity can be violated due to bridges 
but can be considered as a good approximation [206]. 



ized by Hillier and Hanson [161] under the term 'space 
syntax'. In this chapter, we will discuss different recent 
measures of these networks in the light of our current 
understanding of the structure of networks. 



a. Degrees, lengths, and cell areas. In [66, 73] measure- 
ments for different cities in the world are reported. Based 
on the data from these sources, the authors of [39] plot- 
ted (Fig. 20a) the number of roads E (edges) versus 
the number of intersections N. The plot is consistent 
with a linear fit with slope ~ 1.44 (which is consistent 
with the value (k) « 2.5 measured in [66]). The quan- 
tity e = E/N — {k)/2 displays values in the range 
1.05 < e < 1.69, in between the values e = 1 and 
e = 2 that characterize tree-like structures and 2d regu- 
lar lattices, respectively. In a study of 20 German cities, 
Lammer et al. [206] showed that most nodes have four 
neighbors (the full degree distribution is shown in Fig. 22) 
and that for various world cities the degree rarely exceeds 
5 [73]. These values are however not very indicative: pla- 
narity imposes severe constraints on the degree of a node 
and on its distribution which is generally peaked around 
its average value. Few exact values and bounds are avail- 
able for the average degree of classical models of planar 
graphs. In general it is known that e < 3, while it has 
been recently shown [137] that e > 13/7 for planar Erdos- 
Renyi graphs ([137] and section IV. B). 

In Fig. 20b, we can see that the total length £t of the 
network versus N for the towns considered in [73]. Data 
are well fitted by a power function of the form 



(50) 



with fj, « 1.51 and /? « 0.49. In order to understand 
this result, one has to focus on the street segment length 
distribution P{ii). This quantity has been measured for 
London in [224] and is shown in Fig. 21. This figure 
shows that the distribution decreases rapidly and the fit 
proposed by the authors of [224] suggests that 



(51) 



with 7 ~ 3.36 which implies that both the average and 
the dispersion are well-defined and finite. If we assume 
that this result extends to other cities, it means that we 
have a typical distance £i between nodes which is mean- 
ingful. This typical distance between connected nodes 
then naturally scales as 
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where p = N/E^ is the density of vertices and L the 
linear dimension of the ambient space. This implies that 
the total length scales as 
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presented so far, at this stage, is that the road network 
is an homogeneous and translational invariant structure. 
However, this network naturally produces a set of non 
overlapping cells, encircled by the roads themselves and 
covering the embedding plane, and surprisingly, the dis- 
tribution of the area A of such cells measured for the city 
of Dresden in Germany (Fig. 22) has the form 

P{A) - A""' (54) 

with a ~ 1.9. This is in sharp contrast with the simple 
picture of an almost regular lattice which would predict 
a distribution P{A) peaked around i^. It is interesting to 
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FIG. 20 (a) Numbers of roads versus the number of nodes 
(ie. intersections and centers) for data from [73] (circles) and 
from [66] (squares). In the inset, we show a zoom for a small 
number of nodes, (b) Total length versus the number of nodes. 
The line is a fit which predicts a growth as y/N (data from 
[73] and figures from [39]). 
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FIG. 22 (a) Degree distribution of degrees for the road net- 
work of Dresden, (b) The frequency distribution of the cells 
surface areas Ac obeys a power law with exponent a ~ 1.9 
(for the road network of Dresden). From [206]. 

note that if we assume that ^ ^ 1 /£l ^ 1/p and that p is 
distributed according to a law f{p) (with a finite /(O)), 
a simple calculation gives 
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FIG. 21 Length distribution P{ii) for the street network of 
London (and for the model GRPG proposed in this study). 
From [224]). 



This simple argument reproduces well the v N behavior 
observed in Fig. 20 and also the value (given the error 
bars) of the prefactor /i w {k)k/2. 

The simplest hypothesis consistent with all the data 



PiA) ^ -^/(IM) 



(55) 



which behaves as P{A) ~ 1/A'^ for large A. This simple 
argument thus suggests that the observed value ~ 2.0 of 
the exponent is universal and reflects the random vari- 
ation of the density. More measurements are however 
needed at this point in order to test the validity of this 
hypothesis. 

The authors of [206] also measured the distribution of 
the form factor (p = 4A/(7rZ?^), (the ratio of the area of 
the cell to the area of the circumscribed circle) and found 
that most cells have a form factor between 0.3 and 0.6, 
suggesting a large variety of cell shapes, in contradiction 
with the assumption of an almost regular lattice. These 
facts thus call for a model radically different from simple 
models of regular or perturbed lattices. 

Finally, we note that the degree distribution in the 
dual representation has been studied [184] for the road 
network in the US, England, and Denmark and displays 
broad fluctuations with a power law distribution with 
exponent 2.0 < 7 < 2.5. 



b. Betweenness centrality. The importance of a road can 
be characterized by its traffic which can be measured by 
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sensors (see for example [277])'*. If we assume that the 
traffic between all pairs of nodes is the same a natural 
proxy for the traffic is the betweenness centrality. Even 
if the underlying assumptions are not correct the spatial 
distribution of the betweenness centrality gives impor- 
tant information about the coupling between space and 
the structure of the road network. 

Lammer et al [206] studied the German road net- 
work and obtained very broad distributions of between- 
ness centrality with a power law exponent in the range 
[1.279,1.486] (for Dresden « 1.36). These broad distri- 




FIG. 23 Betweenness centrality for the city of Dresden. The 
width of the links corresponds to the betweenness centrality. 
From [206]. 

butions of the betweenness centrality signals the strong 
heterogeneity of the network in terms of traffic, with the 
existence of a few very central roads which very probably 
points to some congestion traffic problems. Also the ab- 
sence of a scale in a power law distribution suggests that 
the importance of roads is organized in a hierarchical 
way, a property expected for many transportation net- 
works [327]. The broadness of the betweenness centrality 
distribution does not seem however to be universal. In- 
deed, in [102, 275], the betweenness centrality distribu- 
tion is peaked (depending on the city either exponentially 
or according to a Gaussian) which signals the existence 
of a scale and therefore of a finite number of congested 
points in the city. The betweenness centrality is in itself 
interesting since it points to the important zone which 
potentially are congested. The Figs. 23,24 display the 
spatial distribution of the betweenness centrality for var- 
ious cities. As expected zones which are central from a 
geographical point of view also have a large betweenness 
centrality. We however see that other roads or zones can 
have a large betweenness centrality pointing to a complex 
pattern of flow distribution in cities. 

In addition to have a relation with the traffic and pos- 



The traffic between two zones given by the origin-destination ma- 
trix (see section III.C) is however much more difficult to obtain. 
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FIG. 24 Colour-coded maps for different centralities in 
Venice, Italy (see [102]). (1) Closeness, (2) Betweenness cen- 
trality, (3) Straightness, (4) Information (from blue to red the 
centrality increases). From [102]. 



sibly the congestion, a recent paper [294] proposes an in- 
teresting direction which is in the general context of con- 
necting topological measures of the networks and socio- 
economical indices. In particular, Strano et al. show 
that there is a clear correlation between the betweenness 
centrality and the presence of commercial activities. 



c. Other measures. Buhl et al. [66] measured different 
indices for 300 maps corresponding mostly to settlements 
located in Europe, Africa, Central America, India. They 
found that many networks depart from the grid structure 
with an alpha index (or Meshedness) usually low. For 
various world cities, Cardillo et al. [73] found that the 
alpha index varies from 0.084 (Walnut Creek) to 0.348 
(New York City) which reflects in fact the variation of 
the average degree. Indeed for both these extreme cases, 
using Eq. (31) leads to unyc ^ (3.38 -2)/4 ~ 0.345 and 
for Walnut Creek awe - (2.33-2) /4 ~ 0.083. This same 
study seems to show that triangles are less abundant than 
squares (except for cities such as Brasilia or Irvine). 

Measures of efficiency are relatively well correlated 
with the alpha index but displays broader variations 
demonstrating that small variations of the alpha index 
can lead to large variations in the shortest path struc- 
ture. Cardillo et al plotted the relative efficiency (see 
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section II.B.2.d) 
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versus the relative cost 
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where GT refers to the greedy triangulation (the max- 
imal planar graph) and MST to the minimal spanning 
tree. The cost is here estimated as the total length of 
segments C = £t (see section II.B.2.d). The obtained re- 
sult is shown in Fig. 25 which demonstrates two things. 
First, it shows - as expected - that efficiency is increas- 
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FIG. 25 Relative efficiency versus relative cost for 20 different 
cities in the world. In this plot the point (0, 0) corresponds 
to the MST and the point (1, 1) to the greedy triangulation. 
From [73]. 

ing with the cost with an efficiency saturating at ~ 0.8. 
In addition, this increase is slow: typically, doubling the 
value of C shifts the efficiency from ^ 0.6 to ~ 0.8. Sec- 
ond, it shows that most of the cities are located in the 
high cost-high efficiency region. New York City, Savan- 
nah and San Francisco have the largest value of the effi- 
ciency (~ 0.8) with a relative cost value around ^ 0.35. 
It seems however at this stage difficult to clearly identify 
different classes of cities and further studies with a larger 
number of cities is probably needed in order to confirm 
the typology proposed in [73]. 

Finally, we mention the recent study [99] on the street 
and subway networks of Paris and London. The acces- 
sibility in these cities is studied in terms of self-avoiding 
random walks displaying several differences. In partic- 
ular, Paris seems to have a larger average accessibility 
than London, probably due to a large number of bridges. 



2. Power grids and water distribution networks 

Power grids are one of the most important infrastruc- 
ture in our societies. In modern countries, they have 



evolved for a rather long time (sometimes a century) and 
are now complex systems with a large variety of elements 
and actors playing in their functioning. This complexity 
leads to the relatively unexpected result that their ro- 
bustness is actually not very well understood and large 
blackouts such as the huge August 2003 blackout in North 
America call for the need of a better understanding of 
these networks. 

The topological structure of these networks was stud- 
ied in different papers such as [9, 14, 288]. In particular, 
in [14] and [9], the authors study the Southern Califor- 
nian and the North American power grids. In these net- 
works the nodes represent the power plants, distributing 
and transmission substations, and the edges correspond 
to transmission lines. These networks are typically pla- 
nar (see for example the Italian case. Fig. 26) and we 
expect a peaked degree distribution, decreasing typically 
as an exponential of the form P{k) ~ exp(— fc/(fc)) with 
(k) of order 3 in Europe and 2 in the US. The other 




FIG. 26 (a) Map of the Italian power grid, (b) Topology of 
the Italian power grid, (c) Degree distribution for the Euro- 
pean network (UCTE), Italy and the UK and Ireland. In all 
cases the degree distribution is peaked and can be fitted by 
exponential. From [288]. 



studies on US power grids confirm that the degree distri- 
bution is exponential (see Fig. 27). In [9], Albert, Albert, 
and Nakarado also studied the load (a quantity similar to 
the betweenness centrality) and found a broad distribu- 
tion. The degree being peaked we can then expect very 
large fluctuations of load for the same value of the degree, 
as expected in general for spatial networks (see section 
II.B.l.d). These authors also found a large redundance 
in this network with however 15% of cut-edges. 

Also, as expected for these networks, the clustering 
coefficient is rather large and even independent of k as 
shown in the case of the power grid of Western US (see 
Fig. 28). 

Beside the distribution of electricity, our modern so- 
cieties also rely on various other distribution networks. 
The resilience of these networks to perturbations is thus 
an important point in the design and operating of these 
systems. In [326], Yazdani and Jeffrey study the topo- 
logical properties of the Colorado Springs Utilities and 
the Richmond (UK) water distribution networks (shown 
in Fig. 29). Both these networks (of size N = 1786 and 
iV = 872, respectively) are sparse planar graphs with 
very peaked distributions (the maximum degree is 12). 
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FIG. 27 Degree distribution of substations in Southern Cal- 
ifornia (top panel) and for the North American power grid 
(bottom panel). In both cases, the lines represent an expo- 
nential fit. From [14] and [9], respectively. 
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FIG. 28 Scaling of the clustering C{k) for the Power Grid 
of the Western United States. The dashed line has a slope 
— 1 and the solid line corresponds to the average clustering 
coefficient. From [263]. 
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FIG. 29 Representation of water distribution networks. Left 
panels (from top to bottom): Synthetic networks ('Anytown' 
[307], and 'EXNET' [127]). Top-right panel: Colorado Spring 
Utilities network. Bottom-right panel: Richmond (UK) water 
distribution network. From [326]. 



3. The Internet 

The Internet is the physical layout for the WWW 
which organizes the information and makes it accessi- 
ble. At a small scale, the Internet is a network of routers 
connected by wires. At a larger scale, the routers are 
grouped according to an administrative authority, the 
autonomous system (AS) and at this level, the Internet 
can be seen as a set of interconnected AS. 

There is obviously a very large literature in computer 
science (but also in statistical physics [255]) about the 
Internet and its structure. Here we will briefly give some 
characteristics of this important network that we believe 
are related to the spatial aspects of the Internet. 

The nodes of this network (routers) are distributed 
among different countries and one expects a non-uniform 
distribution depending on socio-economical factors. In- 
deed, this distribution as been shown in [329] to form a 
fractal set of dimension df ~ 1.5 strongly correlated with 
the population density.^ Concerning the links, there is 
an obvious cost associated with their length and one ex- 
pects a rapidly decreasing length distribution. In most 
models (in particular for the Waxman model [314], see 
also section IV. B. 4), the link distribution is assumed to 
be an exponential of the form eT'^l'^o i^m^ ^j^g authors of 
[329] seemed to observe results consistent with a slower 



The fractal dimension is here the usual one- see for example the 
book [69] and is not the fractal dimension that could be defined 
for a network [289]. 
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decay P{d) ~ 1/d over one decade. 

The clustering coefficient for the Internet is maybe 
more interesting. At the AS level, it varies [263] as 
C{k) ^ 1/k, a behavior typical of many scale-free net- 
works [113]. However in sharp contrast with this result, 
if we do not aggregate the nodes and stay at the router 
level, we observe [263], a clustering coefficient approxi- 
mately constant (Fig. 30). 




k 



FIG. 30 Scaling of the clustering C(fc) for the Internet at the 
router level. The dashed line has a slope —1 and the solid line 
corresponds to the average clustering coefficient. From [263]. 

All these different results suggest that the location of 
nodes is given exogenously and should be the input of a 
realistic model. Once the location of the nodes given, the 
link length distribution and the behavior of the clustering 
coefficient suggest that there is a competition between 
preferential attachment and a spatial dependence, model 
that we will discuss in the next chapter IV. B. 

4. Geography in social networks 

With the development of information technologies, we 
can now have an idea about some aspects of the struc- 
ture of social networks. In particular, as expected, space 
is also important in social networks. It is indeed reason- 
able to think that in order to minimize the efforts and to 
maintain a social tie [338] most individuals will connect 
with their spatial neighbors. As a measure of the social 
tie, Lambiotte et al. [205] used mobile phone data for 
3.3 millions customers in Belgium. For each customer, 
a certain number of attributes is known and in partic- 
ular the zip code of the address where the bill is sent. 
After some filtering procedures, the resulting network is 
made of 2.5 millions nodes (customers) and 5.4 millions 
links, giving an average degree (k) = 4.3. The degree dis- 
tribution is peaked (the power law fit proposed in [205] 
gives an exponent 7 « 5) and the probability P{d) that 
two connected individuals are separated by an euclidean 
distance d is found to behave as [205] 

P{d) ~ d-^ (58) 



over distances in the range 1 — 100 kms. The exponent 
describing the decay with distance is thus equal to 2.0 in 
this example and supports the idea of a gravity law (see 
section III.C.3) even in the context of social ties. This 
result is also consistent with the measure obtained for 
inter-city phone intensity exponent (see [200] and section 
III.C.3.d). According to Kleinberg's navigability theo- 
rem [195], this result also shows that this social network 
is navigable (see section V.D for a discussion on naviga- 
bihty). 

However, another study [217] on a social network of 
more than one millions bloggers in the USA shows con- 
vincingly that the proportion of pairs of nodes at distance 
d which are friends decays as 

P{d) -e^d-" (59) 

where a « 1 and where e w 5 x 10^^. In addition, this 
exponent a of order one is confirmed in another studies 
[21, 140] on Facebook users and on email communica- 
tions. Interestingly, a recent paper [166] showed that 
this spatial scaling P{d) ~ 1/d could result from the fact 
that most individuals tend to maximize the diversity of 
their friendships (which can be seen as the maximization 
of information entropy). The plateau observed in [217] at 
e > shows that above a certain distance (here of the or- 
der 1, 000 kms), the probability P{d) flattens to a con- 
stant value independent of distance and which probably 
results from complex processes acting in social networks. 
For this network, on an average of eight neighbors, we 
then have on average 5.5 friends living in the proximity 
while only 2.5 result from non-geographic processes (for a 
discussion about navigability on this network, see section 
V.D. 6). This is consistent with the results of [140] where 
it is shown that about 40% of emails were sent within the 
same city (for this particular panel of individuals). 

Although there is no agreement so far (but we expect 
rapid developments on this topic) on the nature of the 
decay and the value of the exponent, the existence of 
an important spatial component in social networks is 
rather clear and at this point it seems that on average 
the majority of our friends are in our spatial neighbor- 
hood. This is confirmed in a recent paper [276] which 
measured a variety of parameters on four different on- 
line social networks (BrightKite, FourSquare, LiveJour- 
nal, Twitter) and showed in particular that there is a 
majority of short-distance links between users. This de- 
pendence with space seems to go even beyond friendship 
and applies to other social processes such as collaborat- 
ing and writing a paper together [77]. In fact, the strong 
association between friendship and spatial location led 
Backstrom, Sun, Marlow [21] to propose an algorithm 
-which exceeds IP-based geolocation- to predict the loca- 
tion of people in Facebook who didn't provide this infor- 
mation. 

Clustering seems also to be large as shown in [21, 205]. 
In particular, Lambiotte et al. [205] studied the proba- 
bility c{d) that a link of length d belongs to a triangle. 
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This probability decreases with distance and reaches a 
plateau at around 0.32 above 40 kms. It is interesting 
to note also that the average duration of a phone call 
depends also on the distance (see Fig. 31) and saturates 
to a value of about 4 minutes for the same value d > 40 
kms obtained for c{d). These results led these authors to 
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FIG. 31 Average duration td of a phone call as a function of 
the distance d. A saturation is observed for distances larger 
than 40 kms. From [205]. 

suggest that two communication regimes exist: a short 
distance 'face-to-face' regime with short communication 
durations and a large clustering coefficient and another 
long-distance regime with smaller clustering and longer 
communications . 



C. Origin-destination matrix and mobility networks 

1. Importance of human mobility 

Knowing the flows of individuals from one point to the 
other in a city or a country is a very important piece 
of information upon which many models and studies de- 
pend. First, it is related to the fundamental problem in 
geography and spatial economics of the location of activ- 
ities and their spatial distribution. Second, it is a key 
ingredient in the epidemiology of infectious diseases (see 
section V.F). These diseases transmit between humans 
in close proximity and contaminate the population be- 
cause they travel and interact. Clues on the statistics of 
humans movements and interactions is thus fundamental. 
At a global level for example, the spread of infectious dis- 
eases such as the flu is - in addition to seasonal effects - 
largely controlled by air travel patterns (see [91] and sec- 
tion V.F). Third, statistics about individual movements 
are important for more commercial applications such as 
geomarketing for example. Finding the best spot to place 
your advertisement depends on the number of people go- 
ing through this location and thus implies to know the 
flows, or at least the dominant flows of individuals in the 
area under consideration. 

A simple way to formalize this problem is to divide the 



area of interest in different zones labelled by z = I, . . . ,N 
and to count the number of individuals going from loca- 
tion i to location j. This number is called the origin- 
destination (OD) matrix and is denoted in this review 
by Tij . This quantity is at the core of many transporta- 
tion models and its properties have been discussed in a 
large number of papers. It defines a network, directed 
and weighted, and in the general case is time-dependent. 
This quantity is of course very different from a 'segment' 
measurement where we can easily count the number of 
individuals going through an individual segment of the 
transportation system under consideration (it can be one 
of the link of the airline network, a segment of road, etc.). 
In contrast, the OD matrix is usually extremely difficult 
and costly to obtain and measure. 

Despite all the difficulties in measuring the OD ma- 
trix, recent technological advances such as the GPS, the 
democratization of mobile phones together with geoso- 
cial applications, etc. allow for precise measurements on 
large datasets and point to the possibility to understand 
quantitatively urban movements. In the next sections we 
will then review some recent results in this field. 



2. Distribution of the trip duration and length 

The duration and length of trips in urban areas is ob- 
viously of great importance and in particular, daily trips 
can characterize the economical efficiency of an urban 
area. In particular the duration of daily journey-to-work 
trips give a good indication of the economical health and 
efficiency of a given region. For many different reasons, it 
is then important to understand the factors governing the 
behavior of the statistics of the daily trips. In particular 
in the perspective of convergence to sustainable cities, 
we need to understand such factors in order to reduce 
energy and environmental problems. We are now at the 
stage where always more data is available and it will soon 
become possible to characterize the statistics of human 
movements at many different scales and in particular at 
the urban scale. 

The important case of the statistics of the daily 
journey-to- work travel is an old problem and discussed 
at length in the transportation research literature. One 
can find various empirical observations for different ur- 
ban areas (see for example [233] a short review on this 
subject) and also some theoretical discussions (see for 
example [233]). An important theoretical discussion was 
triggered by Zahavi [330] who suggested that the aver- 
age daily travel time at the regional scale varies little 
over space and time. This hypothesis is interesting in 
the sense that it is the first step towards some 'physi- 
cal laws' ruling human movement. This hypothesis of 
a constant time (and money) budget can be rephrased 
as the 'rational locator hypothesis' states that individu- 
als maintain (if they can choose) steady journey-to-work 
travel times by adjusting their home-work distance (see 
for example [213]. 
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Koelbl and Helbing [196] proposed that the travel time 
can be understood in terms of a travel energy budget. 
More precisely, they found emprically that the average 
travel time for each daily transportation mode considered 
(walk, bicycle, car, bus, train) is approximately constant 
for over 27 years (the data considered was obtained for 
the UK in the period 1972 - 1998, see the end of this 
chapter for a discussion on this point). This result sug- 
gests that there is a travel time budget associated with 
each transportation mode and the main idea put forward 
in [196] is that the energy consumed by individuals for 
travel is constant and mode-independent 



E sa 



(60) 



where E w 615fcJ is the estimated average daily energy 
budget, ti the average travel time for mode i, and Pi 
the energy consumption rate for mode i (which varies 
from « 4fcJ/min for a train passenger to w 15/cJ/min for 
walking) . 

A further observation is that the individual travel time 
(for a mode i) rescaled by the average time of the corre- 
sponding mode Ti = t/ti follows a universal probability 
distribution which in terms of energy Ei — pit — Eti 
reads 



^-aE/Ei-l3Ei/E 



(61) 



where a and /3 are fitting parameters. This distribution 
reflects the variation between individuals but displays a 
universal behavior among modes. The collapse is shown 
in Figure 32. The exponential term e~^^*/^ corresponds 
to the canonical energy distribution when the average 
energy is fixed (and can be recovered by maximizing the 
entropy with the normalization and the fixed average en- 
ergy constraints). The term exp(— a£^/i?i) suppresses 
the short trips refiecting what is called the Simonson ef- 
fect (see [196] and references therein) and leads to the 
selection of a less costly transportation mode. Indeed 
if the trip is too short for a given transportation mode, 
the energy spent for preparation can be larger than the 
energy needed for the trip itself and a less costly trans- 
portation mode is then more efficient. 

These results of Koelbl and Helbing have however been 
challenged in other empirical studies. In [213], Levinson 
and Wu showed that the average daily travel seems to 
be constant over time for a given city but varies from 
one city to another. The average travel time for a given 
city then depends on structural parameters character- 
izing the city (such as population, density, transporta- 
tion networks, congestion, etc). In another study [167] 
on three travel surveys (for France, UK, and Belgium), 
Hubert and Toint criticize the validity of the hypothe- 
sis of constant average daily human energy expenditure 
Eq. 60. However, they show that the distribution of the 
normalized travel time seems to be the same for the three 
different countries of this study pointing to the possible 
existence of universal features of human movement. 
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FIG. 32 Universal travel energy distribution in (a) linear rep- 
resentation and in (b) semilogarithmic representation. The 
solid line is the fit obtained for a — 0.2 and /3 — 0.7. From 
[196]. 



It is clear at this point that a 'unified theory' of hu- 
man travel behavior is still needed as well as other empir- 
ical studies, but the analysis presented in [196] at least 
shows that tools from statistical physics can play a role in 
transportation research and more generally in uncovering 
universal patterns in human behavior. 



a. Mobile phone and GPS studies. As discussed above, ob- 
taining OD matrices is a very difficult and crucial task. 
While traditional surveys are costly and incomplete, mo- 
bile phone data give more complete informations. In par- 
ticular, in large urban areas, the density of antennas is 
large enough so that triangulation gives a relatively ac- 
curate indication of the users' location (mobile devices 
such as phones are regularly in contact with emitters and 
triangulation allows to determine the location of the de- 
vice at a resolution scale given by the local density of 
emitters). For all these reasons, mobile phone data were 
recently used to detail individual trajectories [3], to iden- 
tify 'anchor points' where individuals spend most of their 
time [1, 118], or statistics of trip patterns [144, 285]. Re- 
sults and studies are so numerous now, that we will focus 
on a small subset only and refer the interested reader to 
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[285] and references therein. 

Gonzalez et al [144] used a set of mobile phone data 
at a national level and found that the distribution of 
displacement of all users can be fitted by a Levy law of 
the form 

where /? ~ 1.75, Arg ~ 1.5 km. The cutoff value k, 
depends on the protocol used and varies from k ~ 400 
kms (for protocol Di) and k ~ 80 kms for the protocol 
D2. The protocol Di consists in recording trajectories 
(of 100, 000 users) when the user initiates or receives a 
call or SMS. The protocol D2 captures the location of 
206 mobile phones every two hours for an entire week. 
The origin of the data is not indicated in this paper and 
in particular could mix both rural and urban areas which 
could be the cause of the observed behavior. However, 
this type of approach is very promising and will certainly 
lead in the future interesting insights about human move- 
ment behavior in various areas. 

We note that the measured value for the exponent /3 
is not far from the value 1.6 observed in [64]. In this 
study, Brockmann et al. used data for the movements of 
banknotes across the USA using the 'Where is George' 
database, a system based on voluntary reporting. They 
found that the distance distribution is also Levy dis- 
tributed. However, the presence of a given banknote at 
a given location and at a certain time depends on many 
processes such as the diffusion of the bills, but also on 
the probability to actually use a banknote, and of course 
to report its location to the Whereisgeorges.com website. 
Such crucial factors were not taken into account in [64] 
and it is therefore difficult to assess the validity of the 
obtained Levy law. 

We note here that in a recent paper [290], Song et al. 
proposed a model of human mobility in order to under- 
stand the power laws observed in [64, 144]. In particular, 
this model includes the two following ingredients in the 
description human displacements: First, the tendency to 
explore additional locations decreases with time, and sec- 
ond, there is large probability to return a location vis- 
ited before. This model gives relations between different 
scaling exponents and values which are consistent with 
empirical fits [290]. However, it is still unclear at this 
stage at which scale this study applies and how it can 
be reconciled with the previous studies showing a peaked 
distribution of travel times. 

GPS is another interesting tool in order to charac- 
terize individual trajectories. Recent studies [46, 262] 
used GPS data of private vehicles for the city of Florence 
(Italy), and have shown that the total daily length trip 
is exponentially distributed (see Fig. 33) 

P(^t) ~ e"*^^/^" (63) 

and seems to be independent of the structure of road 
network pointing to a general mechanism which need to 
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FIG. 33 Distribution of total length of daily trips. The expo- 
nential fits gives a slope Lq — 25kms. From [46]. 

be uncovered. In particular, the authors suggest a re- 
lation with this distribution and Maxwell distributions. 
This seems to be actually reminiscent of the general ar- 
gument proposed by Koelbl and Helbing (see [196] and 
previous chapter III.C.2) and indeed points to the possi- 
ble existence of more general principles governing human 
movements. 



b. RFIDs. At an intra-urban scale and even at the scale 
of social networks, RFIDs might provide interesting in- 
sights. RFID stands for Radio Frequency Identification 
and is a technology similar to the bar code and is com- 
posed of a tag which can interact with a transceiver that 
process all informations contained in the tag. RFIDs are 
used in many different instances from tagging goods to 
the Oyster card system in London. In fact it can be used 
anywhere a unique id is needed and the tag can carry 
simple or more complex information about the carrier. 
RFIDs can also be used for dynamical measures of so- 
cial networks such as in the sociopatterns project (see 
http : //www. sociopatterns . com). 

The Oyster card system in London provides informa- 
tion about instantaneous flows of individuals in the sub- 
way system. Since 2003, some 10 million RFID cards 
have been issued to commuters using the London trans- 
port network. In [270], these individual trajectories were 
analyzed, displaying evidence for a polycentric organiza- 
tion of activity in this urban area. These authors also 
found that in agreement with many other transportation 
networks, the traffic is broadly distributed (according to 
a power law with exponent sa 1.3) but also that the dis- 
placement length distribution is peaked (see Fig. 34). 

3. The gravity law 

The origin-destination matrix contains a large amount 
of information and allows to test some ideas about the 
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FIG. 34 Ride distance distribution, (a) Plot of the histogram 
of distances for observed rides. This distribution can be fitted 
by a negative binomial law of parameters r = 2.61 and p = 
0.0273, corresponding to a mean ^ = 9.28 kms and standard 
deviation a — 5.83 kms. This distribution is not a broad law 
(such as a Levy flight for example), in contrast with previous 
findings using indirect measures of movements [64, 144]. (b) 
Ride distance propensity. Propensity of achieving a ride at 
a given distance with respect to a null-model of randomized 
rides. From [270]. 



structure of human movements. In particular, it has been 
suggested (see for example the book [125]) more than 50 
years ago that the number of trips from location i to 
location j follows the 'Gravity' law 

T., = (64) 

where dij — dE{i,j) is the euclidean distance between 
these two locations, P^q) is the population at location i 
(j) and where a is an exponent whose value actually de- 
pends on the system. This idea was generalized to many 
other situations such as the important case in economics 
of international trade [16, 49]. In this case, the volume 
of trade between two countries is given in terms of their 
economical activity and their distance. 

More generally the gravity law (see the theoretical dis- 
cussion in the section III.C.S.c) is written under the form 

T,, ^P,P,/(d(z,j)) (65) 

where the deterrence function / describes the effect of 
space. 

In the next sections, we will focus on the most recent 
measures and concern highways [179], commuters [25], 
and cargo ship movements [185], and phone communica- 
tions [200]. We then end this chapter with a theoretical 



discussion on the gravity law. 

a. Worldwide commuters. Balcan et al. [25] studied re- 
cently flows of commuters at a global scale. They stud- 
ied more than 10^ commuting flow worldwide between 
subpopulations defined by a Voronoi decomposition and 
found that the best fit is obtained for a gravity law of the 
form 

T,j = CPf P/e~'''^/« (66) 

where C is a proportionality constant, and where the 
exponents are: for d < 300 kms, (a, 7) — (0.46,0.64), 
AC = 82 kms, and for d > 300 kms: (a, 7) ~ (0.35,0.37). 
We note an asymmetry in the exponent at small scales 
which probably reflects the difference between homes and 
offices and which does not hold at large scale where ho- 
mogenization seems to prevail. 

At this granularity level, there is then a dependence 
of the traffic on populations and distances with specific 
exponents and with exponentially decreasing deterrence 
function. At a smaller scale, different results for US com- 
muters were obtained in [304] and as suggested in [25] the 
observed differences might found their origin in the differ- 
ent granularities used in these studies (a problem known 
as the 'modifiable areal unit problem' in geography). In- 
deed, in [25], the granularity is defined by a Voronoi de- 
composition, while in [304], counties are used which are 
administrative boundaries not necessarily well spatially 
consistent with gravity centers of mobility processes. 

b. Korean highways. In [179], Jung, Wang, and Stanley 
studied the traffic on the Korean highway system for 
the year 2005. The system consists in 24 routes and 
238 exits, and the total length of the system is about 
3, 000 kms. The highway network is described by a sym- 
metrized weight matrix Tij which represents the traffic 
flow between i and j. The in- and out-traffic are well 
correlated with population, as already seen in the world- 
wide airline network [91] where the population Pi of city 
i scales with the strength Si as 

^ sf (67) 

with a « 0.5 while it is close to one in [107] and [179]. For 
30 cities with population larger than 200, 000, Jung et al. 
study the traffic flow Tij as a function of the population 
of the two cities P; and Pj , and the distance dij between 
i and j and used the original formulation of the gravity 
law 

w., ^ ^ (68) 
with (T = 2 (see Fig. 35). 
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FIG. 35 Traffic flow between i and j as a function of tlie 
variable PiPj/dij. The line has a slope equal to 1.02. From 
[179]. 



c. Global cargo ship. In [185], Kaluza et al. applied the 
gravity law to the global cargo ship network. They used 
the general form of the gravity law, where the number 
of trips Tij is given as (see the discussion in the section 
Ill.C.a.e) 



FIG. 36 (a) Histogram of port to port distances traveled in 
the global cargo ship network. The red line represents the 
gravity model, the blue line a random null model, (b) Count 
of port pairs with Nij observed and Fij predicted journeys. 
From [185]. 



(69) 



inter-city communication intensity follows a gravity law 



(70) 



where Oi is the total number of departures from node 
i and Ij the total number of arrivals at node j. The 

coefficients Ai and Bj are normalization prefactors such where Lij is the total communication time between cities 

Ij . i and j (for a period of six months) . This result showing 

a power law decay with exponent 2 is in agreement with 
other studies on the influence of space on social networks 
(see [205] and section III.B.4). More generally, going be- 
yond the topological structure of the social network and 
including other socio-economical parameters is probably 
a very interesting research direction which will be devel- 
oped in the future. 



that J2j Tij = Oi and Tjj 

The deterrence function used in [185] is a two param- 
eters truncated power law f{d) = d'"^ exp{—d/ k). The 
best correlation was obtained for k = 4, 900 kms and 
a = 0.59. As shown in Fig. 36, this fit compares ex- 
tremely well with the empirical data. In order to test the 
dispersion on every pairs of origin-destination, Kaluza et 
al. [185] plot the observed number Nij of trips between 
i and j with the prediction Fij obtained with the grav- 
ity model. A perfect agreement would give a diagonal 
Fij = Nij and we observe here some dispersion (with a 
small Kendall's tau of t = 0.433) showing that even if 
the gravity law is able to reproduce the histogram, it is 
not accurate enough in order to predict the actual flow 
between a given pair of origin and destination. 



d. Inter-city phone communications. Mobile phone data 
open a new way to analyze geographical structure at var- 
ious scales and in particular allows to exhibit spatial fea- 
tures of social networks. In a recent paper [200] , Krings et 
al. study the network of mobile phone customers and an- 
alyze the geographical pattern of 2.5 millions customers. 
After aggregating by city, these authors found that the 



e. Theoretical discussion. The problem of estimating the 
number of trips from one zone to another is an old prob- 
lem which excited the curiosity of many scientists a long 
time ago. It is reasonable to think that the number of 
trips between two locations i and j is proportional to the 
number of possible contacts given by PiPj and also de- 
creases with distance, or equivalently with time and more 
generally with the cost of travelling from i to j. Readers 
interested in theoretical developments of this model can 
find a full account in the book [125]. It is interesting to 
note that in this book devoted to the gravity model and 
which is one of the most cited reference in this field, all 
the chapters are devoted to theoretical and mathematical 
discussions without any attempt to compare the results 
with empirical observations. 
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There are obviously many factors which control the 
origin-destination matrix: land use, location of indus- 
tries and residential areas, accessibility, etc and it seems 
difficult to provide a general and simple enough model. 
The gravity law in its simplest form retains the popula- 
tion and the distance as relevant factors (more elaborated 
forms with up to dozen of parameters were used in prac- 
tical applications). However, given the importance of 
this law for many practical applications and its extensive 
use in simulations (in epidemiology for example), we be- 
lieve that an extensive meta-analysis for different modes, 
geographical regions and granularities is greatly needed. 
Thanks to the increasing availability of data, we can ex- 
pect in the future a more systematic investigation of this 
problem and we try to provide here a first step in this di- 
rection and we show in Table I a list of various (essentially 
recent) results obtained empirically. As we can observe 
in this table, there are multiple choices for the deterrence 
function and various values of the exponent as well. Par- 
ticularly puzzling is the existence of different values for 
the exponent a (which can vary from « 0.5 to « 3) and 
the different deterrence function for commuters. As sug- 
gested in [25] the observed differences might found their 
origin in the different granularities used in these studies. 
Indeed, in [25], the granularity is defined by a Voronoi de- 
composition, while in [304], counties are used which are 
administrative boundaries not necessarily spatially con- 
sistent with mobility processes (a problem known as the 
modifiable areal unit problem in geography). In addition, 
the different exponents could depend on the transporta- 
tion mode used, of the scale, or other effects linked to the 
heterogeneity of users and trips. 

In this short discussion, we thought that it could be 
useful to recall the classical optimization problem and 
one of the most important derivation of the gravity law 
which uses entropy maximization and also to give a sim- 
ple statistical argument which could shed light on the 
most important mechanisms in this problem. 

Optimization 

We first recall the classical approach which is at the 
basis of many studies (see for example [125]. We are 
interested in this problem in determining the OD matrix 
Tij given the constraints 

Y.^^J=T^ (71) 

i 

Y.T,,=T, (72) 

i 

These represent 2A^ constraints for iV^ unknowns and as 
long as iV > 2 many different choices for Tij are possible. 
If we assume that the transport from i to j has a cost 
Cij we can then choose Tij such that the total cost 

c = 5]r,,a, (73) 

ij 



is minimum. This is the classical transportation problem 
and can be traced back to the 18*'* century and Monge 
[125]. Another approach consists in requiring that — 
T^jViSj where T^ is a given set of interzonal weights and 
where Sj and are given constants. For an extensive 
discussion on this latter approach, see [125]. 

Entropy maximization 

Interestingly enough, the gravity model can be shown 
to result essentially from the maximization of entropy 
[318]. Wilson, a physicist who got interested in trans- 
portation research very early proposed that the trips Tij 
are such that the quantity 

T! 



is maximal which corresponds to trip arrangements with 
the largest number of equivalent configurations (or mi- 
crostates in the statistical physics language). In this ex- 
pression, T = '^ij Tij is the number of total trips and the 
maximization is subject to the natural constraints on the 
origin-destination matrix 

T.^^J^T, (75) 

j 

J2^^j=Tj (76) 

i 

and to a cost constraint 

^T,,Q,=C (77) 

where Cij is the cost to travel from i to j and where C 
is the total quantity of resources available. This max- 
imization is easy to perform with the help of Lagrange 
multipliers and one obtains 

T,;, = A,B^T,T^e-^^^' (78) 

where Ai, Bj and /3 are such that the constraints are 
met. Of course, even if in this expression we do not have 
explicit expressions of Ai and Bj, the most important 
problem at this point is how to express the cost. In order 
to recover a power law distribution, one needs a logarith- 
mic dependence on distance: Cij = alnd^(i,j) which 
leads to Tij cx dsii, ■ If the cost is proportional to 
distance, the number of trips decays exponentially with 
distance. We thus recover two of the most important 
forms used in empirical studies and in model, but the 
exact form of the cost dependence with distance remains 
unsolved. 

There is a long discussion about the validity of this 
approach in [125] but we note that it assumes in par- 
ticular that all individuals act independently from each 
other. This is obviously not correct when we introduce 
congestion which induces correlations between individu- 
als. In such conditions, it is clear that individual choices 
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Network [ref] 


N 


gravity law form 


Results 


Railway express [337] 


13 




cr = 1.0 


Korean highways [179] 


238 




a = 2.0 


Global cargo ship [185] 


951 


Oihd-f am(—diJn) 


a = 0.59 


Commuters (worldwide) [25] 


n/a 


P^P '] eM-d^J/^^) 


(a, 7) = (0.46,0.64) for d < 300 kms 
(q,7) = (0.35,0.37) for d > 300 kms 


US commuters by county [304] 


3109 


PtPJ/c^j 


(a, 7, cr) (0.30, 0.64, 3.05) for d < 119 kms 
(a, 7, a) = (0.24, 0.14, 0.29) for d > 119 kms 


Telecommunication flow [200] 


571 


P^P.d^f 


cr = 2.0 



TABLE I List of various empirical studies on the gravity law (we essentially focused on recent and illustrative results). 



are correlated and that this entropy maximization can 
give reasonable results in the limit of small traffic only. 

A simple statistical argument 

As proposed in [274], a statistical approach could be 
helpful in the determination of important features of 
roads and traffic and more generally can give insights 
into the structure of cities. As discussed above, the ex- 
act empirical measure of Tij is extremely difficult but we 
have access to a more coarse-grained information such as 
for example the total number of miles driven in 357 
US cities^. For a given city of total area A and popu- 
lation P, it is natural to test for simple scalings such as 
one observed in other systems [26, 317]. In particular, 
the natural length scale for a city of area A is given by 
VA and we thus expect a scaling for id of the form 

-^^PP (79) 

Va 

As suggested in [274], we can explore two extreme cases 
of mobility in urban areas. If every individual is going to 
its next nearest neighbor (located at a typical distance 
l/y/p where p = P/A is the average density of the city), 
the total distance £d is given by 

id = Px ^ = VAP (80) 

implying that £d/VA scales as P^ with /3 — 1/2. 

On the other hand, if the individuals are going to ran- 
dom points in the city, the typical distance is given by 
%/ A and the total vehicle miles is given by 

£d-- Px VA (81) 

which implies (5 — 1. 

Following [274] , we plot in Fig. 37a the total number £d 
of miles driven by the whole population P of the city and 
rescaled by \/A for all the 367 US cities in the database. 
We obtain the result /? ~ 0.6 (in [274] this exponent is 



Source: http://www.census.gov 
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FIG. 37 (a) Total rescaled vehicule-miles idj^fA versus for 
357 US cities. The line is a power law fit with exponent /? = 
0.60 (the correlation coefficient is = 0.94). (b) Exponent 
/3 versus r. The red diamond corresponds to the empirical 
measure /3 « 0.6 predicting r « 1.8. 



0.66 for a slightly larger dataset) with an excellent corre- 
lation coefficient. We are thus in an intermediate situa- 
tion between the two extremes described above showing 
that the commuting pattern is probably a mixture of cen- 
tralized trips (to a central business district for example) 
and more local trips, or could also result from centralized 
patterns to many different centers and could thus be a 
signature of a polycentric organization. 

The total number of miles of built lanes was also stud- 
ied in [274] and essentially scales as \J PA which means 
that the road network is essentially a lattice of spac- 
ing 1 / in agreement with the results shown in section 
III.B.l.a. 

Using hand- waving arguments, we can explore the con- 
sequences of this empirical result /3 ~ 0.6 on the general 
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properties of the trip distribution. Since Tij is the num- 
ber of trips from a zone i to a zone j, the total distance 
traveled is given by 



id — Tij dij 



(82) 



had an exponential behavior we would have observed a 
value r > 2 but instead the simple argument presented 
here suggests the behavior 



T, 



1 



(92) 



and we can assume that total number of trips is propor- 
tional to the total number of individuals 



(83) 



(in these equations (82) and (83) some irrelevant constant 
factors are omitted) . The simplest assumption is that the 
trip distribution depends on the distance Tij = f{dij) 
and we can assume that f{x) = Cx~'^ . This is in fact a 
very general case, as the exponential case is in a statisti- 
cal sense reproduced by this form with a > 2. 

We will assume that the distance is uniformly dis- 
tributed and we consider the continuous limit which reads 



,j •'^ 

id = Y,f{d,j)d,,c^C I x^-^ 



(84) 



dx (85) 



where a is the smallest distance given by a ~ ^1 \fp ^ 
L I \fP and where the upper bound is the maximum dis- 
tance L ~ \fA. We have to separate three cases depend- 
ing on the value of r (in the following we will keep the 
dominant terms, knowing that a <ti L). 

(i) Case r < 1. In this case, the leading terms are 



id 
P 



(86) 
(87) 



leading to id ^ P\J A. 

(ii) Case 1 < r < 2. In this case, the dominant terms 
are 



id^ 
P 



(89) 



leading to ~ P ^ 2 ^J~A. 

(iii) Case r > 2. In this case, the dominant terms are 



id' 
P ■ 



Co"-- 
Co}-'' 



(90) 
(91) 



leading to id ~ y/PA. 

These results can be summarized in the Fig. 37b. In 
particular, we can see on this plot that the empirical 
result /3 = 0.6 corresponds to the value r = 1.8. If we 



with r = 1.8 slightly less than 2. This simple scaling 
argument leads to a result in agreement with the gravity 
law and signals a slow decay of the trip volume with dis- 
tance and the simultaneous existence of trips at many dif- 
ferent length scales. Aggregated and scaling arguments 
could thus help in understanding the commuting pat- 
terns, and reveal interesting features of spatial patterns 
in the city. 

Various factors could be at the origin of the different 
behaviors observed in gravity law studies (different deter- 
rent functions, different exponents, etc.). In particular, 
we can list the following factors which could play an im- 
portant role: 

• Importance of transportation modes. It is at this 
point unclear if the gravity law is the same for all 
modes. Also, the cost of a given transportation 
mode can vary in time (which was the case for air 
travel when jets appeared and is the case for cars 
with increasing oil prices) and the gravity law could 
in principle vary in time. 

• Importance of scale and discretization. As dis- 
cussed in this chapter, it is important to use a dis- 
cretization which is consistent with mobility pro- 
cesses. Also, a discussion of scale is probably im- 
portant as it can already be seen on the trip length 
distribution which seems to display peaked laws at 
the urban scale and broad distribution at national 
scales. 

• Nature of the trip. Heterogeneity of users. These 
factors are well-known in transportation research, 
and it could be necessary to distinguish different 
trips (commuting, travel, etc.) in order to get con- 
sistent gravity laws. Users are also very heteroge- 
neous and it is unclear at this stage if we need to 
distinguish between different populations with dif- 
ferent behaviors. 



D. Neural networks 

The human brain with about 10^" neurons and about 
10^** connections is one of the most complex network that 
we know of. The structure and functions of the brain 
are the subjects of numerous studies and different recent 
techniques such as electroencephalography, magnetoen- 
cephalography, functional RMI, etc., can be used in or- 
der to produce networks for the human brain (see Fig. 38 
and for a clear and nice introduction see for example [68] 
and [80]). 
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FIG. 38 Structural and functional brains can be studied with 
graph theory by following different methods shown step-by- 
step in this figure. From [68]. 



logarithmic behavior would predict a ratio 

_ {e){N = 4000) log(4000) _ 
^ ~ {e){N = 1000) ^ log(lOOO) 



1.20 



(93) 



while a 3-dimensional spatial behavior would give a ratio 
of order r 4^^^ w 1.6 which is closer to the observed 
value. Larger sets would however be needed in order to 
be sure about the behavior of this network concerning the 
average shortest path and to distinguish a log N from a 
iV^/^ behavior expected for a three-dimensional lattice. 

Things are however more complex than it seems 
and even if functional connectivity correlates well with 
anatomical connectivity at an aggregate level, a recent 
study [162] shows that strong functional connections ex- 
ist between regions with no direct structural connections, 
demonstrating that structural and functional properties 
of neural networks are entangled in a complex way and 
that future studies are needed in order to understand this 
extremely complex system. 



E. Summary: Existence of general features 



Brain regions that are spatially close have a larger 
probability of being connected than remote regions as 
longer axons are more costly in terms of material and 
energy [68]. Wiring costs depending on distance is thus 
certainly an important aspect of brain networks and we 
can expect spatial networks to be relevant in this rapidly 
evolving topic. So far, many measures seem to confirm 
a large value of the clustering coefficient, and a small- 
world behavior with a small average shortest-path length 
[120, 182]. It also seems that neural networks do not 
optimize the total wiring length but rather the process- 
ing paths thanks to shortcuts [183]. This small world 
structure of neural networks could reflect a balance be- 
tween local processing and global integration with rapid 
synchronization, information transfer, and resilience to 
damage [204]. 

In contrast, the nature of the degree distribution is still 
under debate and a recent study on the macaque brain 
[232] showed that the distribution is better fitted by an 
exponential rather than a broad distribution. Besides the 
degree distribution, most of the observed features were 
confirmed in latest studies such as [331] where Zalesky 
et al. propose to construct the network with MRI tech- 
niques where the nodes are distinct grey-matter regions 
and links represent the white-matter fiber bundles. The 
spatial resolution is of course crucial here and the largest 
network obtained here is of size N ^ 4, 000. These au- 
thors find a large clustering coefficients with a ratio to 
the corresponding random graph value of order 10^ (for 
iV « 4, 000). Results for the average shortest path length 
{£) are however not so clear due to relatively low values 
of iV. Indeed, for N varying from 1, 000 to 4, 000, {£) 
varies by a factor of order 1.7 — 1.8 [331]. A small-world 



Through the different examples presented in this chap- 
ter, we see that some general features emerge. Space 
implies a cost associated to distance and the formation 
of long links must be justified for a 'good economical' 
reason, such as a large degree for example. This simple 
observation leads to various effects that we summarize 
here. 

In the table II, we summarize the main empirical fea- 
tures discussed in this chapter. 

• Network typology 

We can roughly divide spatial networks in two cat- 
egories. The first one consists in planar networks. 
These networks possess many features similar to 
lattices but in some cases (such as the road net- 
work) display very distinctive features which call 
for new models. The other category consists in 
spatial, non-planar networks such as the airline net- 
work, the cargo ship network, or the Internet where 
nodes are located in spaces, edges have a cost re- 
lated to their length, but where we can have in- 
tersecting links. We note here that in some cases 
such as for the Internet, the point distribution is 
not always uniform and could play an important 
role. 

• Effect of space on P{k) 

Spatial constraints restrict the appearance of large 
degrees and P{k) is usually peaked. Constraints 
are stronger for planar networks and for non-planar 
spatial networks such as airlines the degree distri- 
bution can be broad. 

• Effect of space on the link distribution 

Here also the spatial constraints limit severely the 
length of links. For planar networks such as roads 
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TABLE II List of different networks with their main characteristics; number of nodes ; average degree (fc) ; average clustering 
coefficient (C); average shortest path {£); the nature of the degree and weight distributions P{k), P{w); the scaling of the 
strength with the degree s{k) ~ k^; and the relation between the centrality and the degree when it exists g{k) ~ fc*^. 'PL(a;)' 
means that a power law fit was measured and x is the value of the exponent. * existence of many anomalies and ** few 
anomalies. For road networks M denotes the meshedness (denoted by 7 in section II.B.2.b). 



and streets, the distribution is peaked while in 
other networks such as the Internet or the airline 
network, the distribution can be broader. 

• Effect of space embedding on clustering and assor- 
tativity 

Spatial constraints implies that the tendency to 
connect to hubs is limited by the need to use small- 
range links which explains the almost flat behav- 
ior observed for the assortativity. Connection costs 



also favor the formation of cliques between spatially 
close nodes and thus increase the clustering coefh- 
cient. 

• Effect of space on the average shortest path 
The average shortest path for 2d planar networks 
scales as N^^'^ as in a regular lattice. When enough 
shortcuts (such as in the Watts-Strogatz model) are 
present, this a/IV behavior is modified and becomes 
logarithmic. In some three-dimensional situations 
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such as in the brain for example, it is however difR- 
cult in some experiments to distinguish between a 
log N from a N^^"^ behavior. 

• Effect of spatial embedding on topology-traffic cor- 
relations 

Spatial constraints induce strong non-linear cor- 
relations between topology, traffic, and distance. 
The reason for this behavior is that spatial con- 
straints favor the formation of regional hubs and 
reinforces locally the preferential attachment, lead- 
ing for a given degree to a larger strength than 
the one observed without spatial constraints. Es- 
sentially long-range links can connect to hubs only, 
which yields a value > 1. The existence of con- 
straints such as spatial distance selection induces 
some strong correlations between topology (degree) 
and non-topological quantities such as weights or 
distances. 

• Effect of space embedding on centrality 

Spatial constraints also induce large betweenness 
centrality fluctuations. While hubs are usually very 
central, when space is important central nodes tend 
to get closer to the gravity center of all points. 
Correlations between spatial position and central- 
ity compete with the usual correlations between de- 
gree and centrality, leading to large fluctuations of 
centrality at fixed degree. 

• Structure of mobility networks 

The different discussions and empirical examples 
considered in this chapter showed that the average 
trip duration and length distribution actually de- 
pend on many factors such as the scale considered 
(urban, inter-urban, national, or global), the popu- 
lation, congestion, or on the transportation mode, 
etc. Also, it seems that at the urban scale the trip 
duration and length distributions are peaked law 
and that at a larger scale the trip length is dis- 
tributed according to a broad law with exponent of 
order 1.6. The identification of relevant factors and 
the constitution of a clear typology is however still 
an open problem and we can expect in the future 
a wealth of new results in this area. 



Although many results and measures are missing due 
to the lack of data availability, we see that the main ex- 
pected effects of spatial constraints are present for most 
of these networks. Also, traffic properties seem to be rel- 
atively ubiquitous, with a broad traffic distribution, and 
a non-linear strength-degree law with exponent j3 dis- 
tributed around 1.5. In the next chapter, we will discuss 
the various models which attempt to reproduce these dif- 
ferent effects. 



IV. MODELS OF SPATIAL NETWORKS 

In this chapter we will describe the most important 
models of spatial networks. We basically divide these 
models into five large classes (which of course could 
present non empty overlaps): 

• The first class describes geometric graphs which are 
probably the simplest models of spatial networks. 
They are obtained for a set of vertices located in the 
plane and for a set of edges which are constructed 
according to some geometric condition. There is 
a large body of literature on this rather general 
subject and we will here focus on a small set of 
these studies that concern more precisely graph 
constructed with an Euclidean proximity condition. 

• The second class concerns the Erdos-Renyi model 
and its spatial generalization, including the spatial 
hidden variables models. These networks are ob- 
tained when the probability to connect two nodes 
depends on the distance between these nodes. An 
important example of this class is the Waxman 
model [314] for the Internet structure. 

• The third class comprises spatial variants of the 
Watts-Strogatz model [313] and could be coined as 
spatial small-worlds. In these cases, the starting 
point is a d-dimensional lattice and random links 
are added according to a given probability distri- 
bution for their length. 

• The fourth class concerns spatial growth models 
which can be considered as spatial extensions of 
the original growth model proposed by Barabasi 
and Albert [8]. 

• Finally, the last class concerns optimal networks 
obtained by the minimization of a 'cost' function. 
These networks were considered already a long time 
ago in different fields (mathematics, transportation 
science, computer science) and are now back with 
the explosion of studies on complex networks. 

A. Geometric graphs 

A random geometric graph is obtained when the points 
located in the plane are connected according to a given 
geometric rule. The simplest rule is a proximity rule 
which states that nodes only within a certain distance are 
connected. There is an extensive mathematical literature 
on geometric graph and the random case was studied by 
physicists in the context of continuum percolation (see 
for example [24, 261]). 

1. The simplest random geometric graph 

In the framework of complex networks, one of the first 
studies on random geometric graphs is provided in [104] 
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FIG. 39 Example of a 2d random geometric graph obtained 
for N = 500 spheres and (fc) = 5. From [104]. 



where the nodes are small spheres of radius r and two 
nodes are connected by an edge if they are separated by 
a distance less than 2r (which is the condition for the in- 
tersection of spheres of radius r). This network is related 
to the unit disk graph defined in geometric graph theory, 
as the intersection graph of a family of unit circles in 
the Euclidean plane: if we draw a vertex at the center of 
each circle, we connect two vertices if the corresponding 
circles intersect. This unit disk graph has been used in 
computer science to model the topology of ad-hoc wire- 
less communication networks where the nodes are wire- 
lessly connected to each other, without a base station 
[168]. The nodes are located in a two-dimensional plane 
and the area within which a signal can be transmitted to 
another node is described as a circle (of the same radius 
if all nodes have the same power to transmit). These ran- 
dom geometric graphs have also been used in continuum 
percolation models [24, 261]. 

If the volume of the total system is 1 then the proba- 
bility p that two randomly chosen vertices are connected 
is equal to the volume of the sphere of radius R — 2r 
which reads in d dimensions 



P = V{R) = 



r(i + d/2) 



(94) 



In the context of continuum percolation this quantity is 
the excluded volume Ve = p. The average degree is then 
given by (fc) = Np and we can then express i? as a func- 
tion of (fc) 



R = 



1 



{k)^(d + 2 
N \ 2 



l/d 



(95) 



which shows that for a given average degree (fc) the nodes 
(spheres) have to become smaller when more nodes are 
added. 

Similarly to the usual ER random graph, there is a 



critical average degree above which there is a non empty 
giant component. The authors of [104] computed numer- 
ically this critical value (fc)c for different dimensions and 
proposed the scaling (fc)c = 1-1- bd~^ with h — 11.78(5) 
and 7 — 1.74(2). This relation also states that in infi- 
nite dimension the random geometric graph behaves like 
a ER graph with (fc)c = 1. 

In [156], the authors compute analytically the degree 
distribution for these random geometric graphs. If we 
assume that the points are distributed according to a dis- 
tribution p[x) and the condition for connecting to nodes 
i and j located at positions and Xj, respectively, is 
^Eihi) < -R, we can then estimate the degree distribu- 
tion. If we denote by Bb.{x) the ball of radius R and 
centered at x, the probability qr^x) that a given node is 
located in Bpi{x) is 



(1r{x) = / dx'p{x') 

JBr(x) 



(96) 



The degree distribution for a node located at x is thus 
given by the binomial distribution 



Pik;x,R)= ^ ^)to(x)^[l-te(x)] 



N-l-k 



(97) 



In the limit — > oo and i? — > 0, the degree distribution 
for a node located at x is Poissonian and reads 

P(k; X, a) = -^aVa;)'=e-"P(^) (98) 
fc! 

where a ~ (k)/ J dxp^{x) fixes the scale of the average 
degree. For example this expression gives for a uniform 
density p{x) — po a degree distribution of the form 



P(fc) 



(apo)^ 
klk'i 



(99) 



which decays very rapidly with fc. In contrast if the den- 
sity decays slowly from a point as p{r) ~ r^^ we then 
obtain P{k) ^ fc^'*/^ showing that large density fluctua- 
tions can lead to spatial scale-free networks [156]. 

The average clustering coefficient can also be calcu- 
lated analytically and the argument [104] is the follow- 
ing. If two vertices i and j are connected to a vertex fc 
it means that they are both in the excluded volume of 
fc. Now, these vertices i and j are connected only if j is 
in the excluded volume of i. Putting all pieces together, 
the probability to have two connected neighbors (ij) of 
a node fc is given by the fraction of the excluded volume 
of i which lies within the excluded volume of fc. By av- 
eraging over all points i in the excluded volume of fc we 
then obtain the average clustering coefficient. We thus 
have to compute the volume overlap pd of two spheres 
which for spherical symmetry reasons depends only on 
the distance between the two spheres. In terms of this 
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function, the clustering coefficient is given by 

{Cd) = ^ / Pd{r)dV (100) 

For d = 1, we have 

pi{r)^{2R-r)/2R=l-r/2R (101) 
and we obtain 

(Ci) = 3/4 (102) 

For d = 2, we have to determine the area overlapping in 
the Fig. 40 which gives 

P2{r) = {9{r)-sm{e{r)))/Tr (103) 

with 9{r) — 2 arccos(r/2_R) and leads to 

(C2) 1 - 3%/3/47r w 0.58650 (104) 

Similarly an expression can be derived in d dimension 




FIG. 40 The overlap between the two disks (area comprised 
within the bold line) gives the quantity piir). From [104]. 



[104] which for large d reduces to 



{Co) 



(105) 



a. Application to ad-hoc networks: Calculation of the giant 
component. In ad-hoc networks [242], users communi- 
cate by means of a short range radio device. This means 
that a device can communicate with another one if their 
distance is less than a certain distance which corresponds 
to the transmission range of the device. The set of con- 
nected devices can be used to propagate information on 
longer distance by going from the source to the destina- 
tion hop by hop through intermediate nodes. If there is a 
large density of nodes, alternate routes are even available 
which allows to split the information into separate flows. 
Usually, the users are mobile and the network evolves in 
time and it is important to understand the condition for 
the existence of a giant cluster. 

In [242], Nemeth and Vattay explicitly compute the 
condition for the appearance of a giant component using 
generating functions such as in [248]. In particular, it is 
not difficult to show that the fraction S of nodes which 
are in the giant component is given by 

S = 1- Go{u) (106) 

where the generating function Go (a;) is given by 

Go{x) = Y,Pik)x'' (107) 



where P{k) is the probability that a node has k neigh- 
bors. The quantity u is the probability of not belonging 
to the giant component and satisfies 



u — Gi{u) 



(108) 



where Gi{x) = G'q{x) / G'q{1) (see for example [32]). 

In the ad-hoc network case, the typical internode dis- 
tance To depends on the density p — N/A as 



1 



(109) 



The range of the radio devices is denoted by rr and the 
important dimensionless parameter is here ^/rr/rQ. The 
probability to have k neighbors is a binomial law of pa- 
rameter pirr^ = rj and in the limit of large N (with Nrj 
constant) we thus obtain 



The average clustering coefficient thus decreases from 
the value 3/4 for d = 1 to values of order 10"^ for d of 
order 10 and is independent from the number of nodes 
which is in sharp contrast with ER graphs for which 
(C) ~ 1/A'^. Random geometric graphs are thus much 
more clustered than random ER graphs. The main rea- 
son - which is in fact valid for most spatial graphs ~ is 
that long links are prohibited or rare. This fact implies 
that if both i and j are connected to k it means that there 
are in some spatial neighborhood of k which increases the 
probability that their inter-distance is small too, leading 
to a large (C). 



P(fc) = e-"|^ 



(110) 



This particular form of P{k) allows to compute Gq{x) 
and Gi {x) and we eventually obtain an implicit equation 
for S 



Or, after inversion 



V = 



-logons) 

S 



(111) 



(112) 



This last relation gives the minimum transmission rate 
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in order to get a giant component of size NS. 

Also, in this model one can introduce a probability p{r) 
that the devices are connected at distance r and different 
quantities such as the average clustering coefficient can 
be computed (see [242] for details). 



to the average degree at distance r (for large R) 
{k){r) « -iVe-''/2 



The average degree of the network is then 



(116) 



b. Model of mobile agents for contact networks. A model 
for contact networks based on mobile agents was pro- 
posed in [141-143]. In this model, individuals are de- 
scribed by disks with the same radius and are moving 
and colliding in a two-dimensional space. The contact 
network is built by keeping track of the collisions (a link 
connects two nodes if they have collided) . A collision 
takes place whenever two agents are at a certain distance 
and this model can thus be seen as a dynamical version of 
random geometric graphs. In particular, Gonzalez, Lind, 
Herrmann [142] used this framework to model the sex- 
ual interaction network and interestingly enough, found 
in some cases and for a simple collision rule such that 
the velocity grows with the number of previous collisions 
that a scale-free network with exponent 7 = 3 emerges. 



2. Random geometric graph in iiyperbolic space 

Motivated by studies on the Internet, a model of ran- 
dom geometric graph in hyperbolic space was proposed 
recently (see [284] or the small review article [201]). In 
their studies, Boguha, Krioukov and Serrano considered 
the two-dimensional hyperbolic space of constant neg- 
ative curvature equal to K — ~(^^ — —1 and use a polar 
representation (r, 9) for the nodes. They placed N points 
distributed uniformly in a disk of radius R and in the Eu- 
clidean disk projection this implies that the nodes have 
a uniform angle distribution U{9) = l/27r for 6 e [0, 27r] 
and that the radial coordinate is distributed according to 



p{r) 



sinh) 



cosh i? — 1 



(113) 



They used then the usual geometric graph rule and con- 
nect two nodes if their hyperbolic distance is less than R 
which can be written in terms of the connection proba- 
bility as 



pc{x) =e{R-x) 



(114) 



where Q is the Heaviside function. The hyperbolic dis- 
tance d between two nodes {r,6) and {r',d') is defined 
by 

cosh (d = cosh cosh (r' — sinh (^r sinh cos (6*' — 6) 

(115) 

The expected degree of a node at point X located at 
distance r from the origin O is proportional to the area 
of the intersection of the two disks of radius R centered at 
O and X, respectively. The disk centered at O contains 
all nodes and the intersection can be calculated leading 



(fc) = / p{r){k){r)dr^-Ne'^/^ 



(117) 



The degree distribution is [201] 



P{k) = 2 



ik)Ynk-2Ak)/2)^^_, (118) 



and behaves as a power law with an exponent equal to 
three, as in the BA model. This network heterogeneity 
however follows directly from the properties of the hyper- 
bolic geometry and not from a preferential attachment 
mechanism. 

Krioukov ct al. also considered the case where the node 
density is not uniform 

sinhar 

= " coshai?-l " ' ^'''^ 

In this case, it can be shown that the degree distribution 
is also a power law of the form 



where 



P{k) - k- 



2a/C for f > 1/2 
for f < 1/2 



(120) 



(121) 



The power law exponent depends then both on the hy- 
perbolicity C ^nd a, through the ratio a/^, a result ex- 
pected from an analogy with trees (see [201]). We thus 
obtain naturally scale-free networks as random geometric 
graphs on hyperbolic spaces and conversely, it can also be 
shown that scale-free networks can result from a hidden 
hyperbolic metric [201]. 

In line with random geometric graphs in euclidean 
space, there is also a strong clustering for these graphs 
constructed on hyperbolic space (unfortunately the clus- 
tering coefhcient cannot be computed analytically). 

Finally, the authors of [201] extended their model and 
introduce an inverse 'temperature' /3 akin to usual sta- 
tistical mechanics in the connection probability 



Pc{x) 



1 + e 



(122) 



and showed that this model reproduces well the In- 
ternet measurements for P{k), the assortativity knn{k)^ 
and the clustering coefficient C{k) for a = 0.55, C = 1 
and B = 2. 
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A = 2.5 A = 5 




FIG. 41 Spatial structure of the network obtained by the 
method proposed in [271] for different values of the exponent 
A of degree distribution. In (a) the networks are shown and 
in (b) the corresponding chemical shells of equidistant sites 
form the central node. From [271]. 



3. A scale-free network on a lattice 

As we saw in the section IV. A. 1 geometric graphs con- 
structed on uniformly distributed points naturally lead to 
networks with degrees distributed according to a Poisson 
distribution. It is however interesting to generate spa- 
tial graphs with a broad distribution (scale-free network) 
in order to understand the effect of strong heterogeneity 
on spatial networks. In [271], Rozenfeld et al. proposed 
a simple method to construct a scale-free network on a 
lattice (and in [310] another variant is suggested). This 
model is defined on a d-dimensional lattice of linear size 
R and with periodic boundary conditions. A random de- 
gree k is assigned to each node on this lattice according 
to the probability distribution 

P{k) = Ck-^ (123) 

The idea is then simple: we connect a randomly chosen 
node i to all its neighbors until its degree reaches its 
assigned value fcj. The larger k and the larger the region 
contained connected neighbors. The size of this region is 
then given by 

r(fc) = Ak^/'' (124) 

where the prefactor A essentially depends on the density 
of nodes. We show in Fig. 41 the obtained networks for 
two different values of A. The larger A and the shorter 
the links (large k and therefore long links are very rare) 
and the closer we are to a regular lattice. We can define 
a chemical shell £ as consisting of all sites at minimal 



distance £ from a given site. For large A these chemical 
shells are concentric (as in the case of a regular lattice) 
while for smaller A the presence of long links destroys 
this order. Scaling arguments proposed in [271] suggest 
that the minimal length exponent is given by 

A — 2 

" A - 1 - 1/d ^^^^^ 

This network has then the curious property to have a 
fractal dimension which stays identical to the euclidean 
dimension, but with a minimal length exponent c?„jj„ < 1 
for all A and d > 1. 

The authors of [310] studied the percolation properties 
of such a model and found that for these spatial scale- 
free networks with a degree distribution P{k) ~ k~^, 
the percolation threshold in the limit of infinite networks 
does not go towards zero (for A > 2), in sharp contrast 
with non-spatial scale- free network which have pc = for 
A < 3 (and N = oo). 

In a non-spatial scale-free network, there are many 
short paths between the different hubs of the system eas- 
ing the percolation. In contrast, for spatial (scale-free) 
networks there is a high local clustering due to the lim- 
ited range of links which naturally lengthen the distance 
between hubs. This negative assortativity makes it thus 
more difficult to achieve percolation in such a system 
hence the existence of a non-zero threshold. As noted in 
[310], the spread of a disease too would be easier to con- 
trol than on a scale-free network (see section V.F) which 
is expected as spatial containment is usually easier to set 
up. 



4. Apollonian networks 

Other models were proposed in order to obtain a spa- 
tial scale-free network. In particular, Apollonian net- 
works were introduced by Andrade et al. [178] where they 
constructed a scale-free network (Fig. 42) from a space- 
filling packing of spheres and by connecting the centers of 
touching spheres by lines. These networks are simultane- 
ously planar, scale-free (with exponent 7 = 1-1- In 3/ In 2, 
small- world- in fact 'ultra-small' with an average shortest 
path varying as 

(^) - (log TV) (126) 

and a clustering coefficient larger than 0.8 for large N. 
Various quantities for the Apollonian network and one of 
its variant are also computed and discussed in [333, 334]. 
Due to all these simultaneous properties, Apollonian net- 
works provide an interesting playground to test theoret- 
ical ideas. 
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FIG. 42 Top: Classical Apollonian packing. Bottom: Apol- 
lonian network (showing the first, second and third generation 
with circles, squares, dots symbols respectively). From [178]. 



B. Spatial generalization of the Erdos-Renyi graph 

1. Erdos-Renyi graph 

The Erdos-Renyi (ER) graph [122-124] is the paradigm 
for random graphs and is in many cases used as a null 
model. One simple way to generate it is to run through 
all pairs of nodes and to connect them with a probability 
p. The average number of links is then 



{E)=p 



N{N ~ 1) 



(127) 



giving an average degree equal to (fc) = 2{E) /N = p{N — 
1). This last expression implies that in order to obtain 
a sparse network, we have to choose a small p scaling as 
p = (fc) /N for large N. 

The establishment of edges are random, independent 
events and we thus obtain a binomial distribution for the 
degree /c of a node 



P{k) = 



N - 1 
k 



vHi-p) 



N-k+l 



(128) 



which can be approximated by a Poisson distribution 



p{k) 



k\ 



(129) 



for large N with pN — (fc) constant. Other classical 
results can be easily derived such as the clustering coef- 
ficient which can be shown to be C = p and the average 
shortest path (£) ~ log A^/ log(fc). In fact for any gen- 
eralized uncorrelated random graph characterized by a 



probability distribution P{k) it can be shown (see for 
example [32]) that the average clustering coefficient is 



(C) 



1 {{P)~{k)f 



N 



and that the average shortest path 



lo_g N/{k) 
log 



(fc) 



(130) 



(131) 



These last two well-known expressions are useful in the 
sense that they provide a reference to which we can com- 
pare results obtained on a specific network in order to 
understand its features. In particular, we can expect 
very different behavior for spatial networks signalled by 
different scaling of these quantities with N. 



2. Planar Erdos-Renyi graph 

The first simple idea to generate a random planar 
graph would be to first generate a set of points in the 
two-dimensional space and then to construct an Erdos- 
Renyi graph by connecting randomly the pairs of nodes. 
It is clear that in this way we will generate mostly non- 
planar graphs but we could decide to keep links which 
preserve planarity. We would then obtain something like 
represented in Fig. 43 (top). It is easy to imagine even 




FIG. 43 Top: Planar Erdos-Renyi network obtained by re- 
jecting links if they destroy planarity (A'^ — 1000). Bottom: 
Planar Barabasi-Albert network obtained by the same rejec- 
tion method. 

other extensions of this process and we could construct 
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a planar BA network by adding in the two-dimensional 
space nodes (with a random, uniformly distributed lo- 
cation) which will connect according to the preferential 
attachment. We would then keep the node if it preserves 
the planarity of the system. We note here that there are 
some visual similarities with networks obtained by ran- 
dom sequential adsorption of line segments [336] and it 
would be interesting to understand if there are deeper 
connections between these problems. 

Instead, mathematicians studied closely related net- 
works. If we denote by Pn the class of all simple labelled 
planar graphs on N vertices ^. We can draw a random 
planar graph Rj^ from this class with a uniform probabil- 
ity [108] and ask for some questions [137, 227] such as the 
number of vertices of a given degree, the number of faces 
of a given size, etc. For instance, the following results 
have been demonstrated ([108, 137, 227] and references 
therein) 

• The random planar graph i?jv is connected with 
probability at least 1/e. 

• The number Mu of unlabelled planar graphs scales 



N 



(132) 



with 9.48 < 7„ < 32.2 and the number TV; of la- 
belled planar graphs as 

M~7,(7V!)i/^ (133) 

where 27.22685 < 7^ < 27.22688. 

• The average number of edges of Rn is (E) > ^-N. 

• The degree distribution decreases at least as 
N/jl{k + 2)1. 

Other properties can be derived for this class of networks 
and we refer the interested reader to [108, 137, 227] and 
references therein for more results. 

3. The hidden variable model for spatial networks 

In the Erdos-Renyi model, the probability p to con- 
nect two nodes is a constant. In certain situations, we 
could imagine that a node is described by a number of 
attributes (called hidden variables or fitnesses) and the 
connection between two nodes could depend on the re- 
spective attributes of these nodes [55, 71]. In order to 
give a concrete example, we assume that there is only one 
attribute i] which is a real number distributed according 
to a function p{i]). The probability of connection for a 



^ A labeled graph refers to a graph where distinct labels are as- 
signed to all vertices. The labeling thus adds configurations in 
the counting process of these graphs. 



pair of nodes (i, j) is then given by pij = f(jii,rij) where 
/ is a given function. In the case / = const, we recover 
the ER random graph. The average degree of a node 
with fitness 77 is given by 

/•OO 

Hv) = N / fiv, v')p{v')dv' = NF{7j) (134) 
Jo 

and the degree distribution is then 



P(fc) ^ / p{rj)S{k 



P 



Hv)) 
±1 

dk 



(135) 



A surprising result appears if we chose for example an 
exponential fitness distribution {p{rii) ~ e~'') and for the 
function / a threshold function of the form 



f{'q„iy)=e[r^,+T^,j-z{N)] 



(136) 



where is the Heaviside function and z{N) a threshold 
which depends in general on N . In this case, Caldarelli 
et al. [71] found a power law of the form P{k) ^ k^^ 
showing that a scale-free network can emerge even for 
a peaked distribution of fitnesses. A spatial variant of 
this model was proposed in [225] (and discussed together 
with other models in [154]) where the nodes i and j are 
connected if the following condition is met 



{m + Vj)h[dE[i,i)] > (j) 



(137) 



where h[r\ is a decreasing function and a constant 
threshold. For this model, large fitnesses can therefore 
compensate for larger distances and we will observe large- 
fitnesses nodes connected by long links. If the distribu- 
tion of fitnesses /(?/) has a finite support or is strongly 
peaked around some value, we will have a typical scale 
ro = h~^{(j>/2rj) above which no (or a very few) con- 
nections are possible. As a result, the average shortest 
path will behave as for a lattice with {£) ^ N^/''- in a 
d-dimensional space. For an exponential fitness distribu- 
tion 7(77) = e^^^ and h{r) — r^^, the authors of [225] 
find various degree distributions according to the value 
of (3 and goes to a power-law p{k) ~ fc~^ for /3 — 
and to an exponential distribution for /3 — d. Various 
other cases were also studied in [71, 225] and help to un- 
derstand within this model when scale-free distribution 
could appear. 

Finally, we mention here a generalization to other met- 
rics than the spatial distance [56, 319]. In particular, in 
[56], the probability that two individuals are connected 
decreases with a particular distance between these in- 
dividuals. This distance is computed in a 'social' space 
and measures the similarity for difi^erent social attributes. 
This model is able to reproduce some of the important 
features measured in social network such as a large clus- 
tering, positive degree correlations and the existence of 
dense communities. More recently, Serrano et al. [284] 
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developed the idea of hidden metric space by using the 
one-dimensional circle as an underlying metric space in 
which nodes are uniformly distributed. A degree k drawn 
from a law P{k) ~ k~'^ for each node and each pair of 
nodes is connected with a probability r{d] k, k') that de- 
pends on the distance d between the nodes and also on 
their respective degrees k and k' . In particular, they 
studied the following form 



r{d;k,k') 



dcik,k') 



(138) 



where a > 1 and where dc kk' for example. The prob- 
ability that a pair of nodes is connected decreases then 
with distance (as d"") and increases with the product 
of their degrees kk' . In this case a long distance can be 
compensated by large degrees, as it is observed in var- 
ious real-world networks. In this model, in agreement 
with other models of spatial networks, we observe a large 
clustering (for a large enough). 



4. The Waxman model 

The Waxman model [314] is a random network topol- 
ogy generator introduced by Waxman and appears as 
a spatial variant of the ER model. In this model, the 
nodes are uniformly distributed in the plane and edges 
are added with probabilities that depend on the distance 
between the nodes 



(139) 



The quantity do determines the typical length of the links 
and P controls the total density of links. In terms of 
hidden variables the attribute is here the spatial location 
of the node and the pair connection probability depends 
on the distance between the nodes. For do ^ oo, length 
is irrelevant and we recover the ER model while for do ~ 
(where p is the average density of nodes in the 
plane) long links are prohibited and we are in the limit 
of a lattice-like graph. 

Even if this model is very simple, it served as a first 
step towards the elaboration of more sophisticated mod- 
els of the Internet [332] . Also, despite its simplicity, this 
Waxman model can be used in order to understand the 
importance of space in different processes taking place on 
this network. We can cite for example navigation or con- 
gestion problems in communication systems (see section 
V.D). 

A growth model close to the Waxman model was pro- 
posed in [181] where at each time step a new node u is 
added in the plane and is connected to existing nodes 
V with a probability (if it fails to connect the node is 
discarded) 

P(u,w) = ;9e-"''^("'") (140) 
or even decreasing as power law P{u,v) ^ dE{u,v)'^'^ . 



Networks generated with this algorithm have a large clus- 
tering coefficient (as expected) and probably a large di- 
ameter (although there is no quantitative prediction in 
[181]). 



C. Spatial small worlds 

1. The Watts-Strogatz model 

Already in 1977 spatial aspects of the small-world 
problem were considered by geographers in the paper 
[293] but we had to wait until 1998 when Watts and 
Strogatz (WS) proposed a simple and powerful network 
model [313] which incorporates both a spatial component 
and long-range links. This model is obtained by starting 
from a regular lattice and by rewiring at random the links 
with a probability p (Fig. 44) . 






FIG. 44 Construction of the Watts-Strogatz model for TV = 8 
nodes. At p = each node is connected to its four nearest 
neighbors and by increasing p an increasing number of edges 
is rewired. Adapted from Watts and Strogatz [313]. 

The degree distribution of this network has essentially 
the same features as the ER random graph, but the clus- 
tering coefficient and the average shortest path depend 
crucially on the amount of randomness p. The average 
clustering coefficient has been shown to behave as [28] 



{C{p)) 



3(to- 1) 
2(2m - 1) 



(141) 



where the average degree is (fc) = 2m. The average short- 
est path has been shown to scale as [36, 37] 



N 



where the scaling function behaves as 



:f(x) 



X for a; <C 1 
In X for X 3> 1 



(142) 



(143) 



The crossover size scales as N* ^ l/p [28, 36, 37] which 
basically means that the crossover from a large-world to 
a small-world occurs for an average number of shortcuts 
equal to one 

N*p - 1 (144) 
The network can thus be seen as clusters of typical size 
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N* (p) connected by shortcuts. 

The interest of these networks is that they can simuha- 
neously present some features typical of random graphs 
(with a small- world behavior £ ~ log N) and of clustered 
lattices with a large average clustering coefficient (while 
for ER random graph (C) ^ 1/^)- 



shortcuts p*N = p*{L)L'^ then satisfies 

P,{L)p*{L)L'' (147) 

which means that if we have a fraction p > p* of long 
shortcuts, the system will behave as a small-world. We 
then obtain 



2. Spatial generalizations 

One of the first variants of the Watt-Strogatz model 
was proposed in [173, 195, 280] and was subsequently 
generalized to higher dimensions d [282]. In this variant 



A 



^ C 



L-<^ if a < 1 

l^-d-l if Q, > 1 



(148) 



and a logarithmic behavior log L/L'^ for a — \. For a 
given value of p we thus have one length scale 



Ip^i/'' if a < 1 

,pl/(a-<i-l) if > 1 



(149) 
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FIG. 45 Schematic representation of spatial small-world 
in (a) one dimension and (b) two dimensions. The dashed 
lines represent the long-range links occuring with probability 
q{£) ~ £-". From [282]. 

(see Fig. 45), the nodes are located on a regular lattice in 
d-dimensions with periodic boundaries. The main idea is 
that if the shortcuts have to be physically realized there 
is a cost associated with their length. A way to model 
this is to add links with a probability 



(145) 



For each node, we add a shortcut with probability p 
which implies that on average there will be pN additional 
shortcuts. 

Concerning the average shortest path, it is clear that 
if a is large enough, the shortcuts will be small and the 
behavior of {£) wih be 'spatial' with {£) ~ N^^'^. On 
the other hand, if a is small enough we can expect a 
small-world behavior £ ^ logiV. In fact, various studies 
[173, 239, 282] discussed the existence of a threshold etc 
separating the two regimes, small- and large-world. We 
follow here the discussion of [259] who studied carefully 
the behavior of the average shortest path. The probabil- 
ity that a shortcut is 'long' is given by 



rL/2 

Pc{L) = / q{£)d£ 

J(l-c)L/2 



(146) 



which in the special case a = was obtained in [247] . We 
will then have the following scaling form for the average 
shortest path 



where the scaling function varies as 



Ta{x) 



X if a: ^ 1 
In a; if a; 3> 1 



(150) 



(151) 



(or even a function of the form (Ina;)"''"-' with a{a.) > 
for X ^ 1). The characteristic length for a > 1 thus 
scales as 



(152) 



and displays a threshold value ac — d+1 a, value already 
obtained with the average clustering coefficient for d ^ 1 
in [282]. For a > ac, the length scale L*{p) is essentially 
finite and less than 1, which means that for all values of 
L the system has a large-world behavior with {£) ^ L. In 
other words, the links in this case cannot be long enough 
and the graph can always be coarse-grained to reproduce 
a regular lattice. In the opposite case a < ac, the length 
L* (p) diverges for p — and there will always be a regime 
such that L* {p) 3> L implying a small- world logarithmic 
behavior. 

Finally, we mention a recent numerical study [197] of 
this model which seems to show that for a > d there are 
two regimes. First, ior d < a < 2d 



{£) - (logN)"^"'' 



with 



a = 




for d = 2 



(153) 



(154) 



where c is small but non-zero. The critical fraction of 



The second regime is obtained for a > 2d where the 'spa- 
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tial' regime (£) ~ N^/''- is recovered. We note that nu- 
merically, the scaling prediction of [259] with two regimes 
only and the result of [197] are however difficult to dis- 
tinguish. For d = 1 there are no discrepancies {a — \ for 
a — d— 1) and for d = 2, the results for a = 2 and a — A 
are consistent with the analysis of [259] . A problem thus 
subsists here for d = 2 and a = 3 for which a = 4/3, a 
value probably difficult to distinguish numerically from 
corrections obtained aX a = Uc — d + \. 



erentially connect to hubs, unless the hub is too far in 
which case it could be better to connect to a less con- 
nected node but closer in space. In order to have long 
links, the target node must have a large degree in order to 
compensate for a small F{d) such that kF{d) ~ 1. This 
is for instance the case for airlines: Short connections go 
to small airports while long connections point preferably 
to big airports, ie. well-connected nodes. 



D. Spatial growth models 

1. Generalities 

In this part we will review models of growing networks 
which essentially elaborate on the preferential attach- 
ment model proposed by Albert and Barabasi [8, 287]. 
In the preferential attachment, there is a propensity to 
connect a new node to an already well-connected one 
[8, 287] which is probably an important ingredient in the 
formation of various real- world networks. 

The process to generate such a Barabasi- Albert (BA) 
network is thus extremely simple. Starting from a small 
'seed' network, we introduce a new node n at each time 
step. This new node is allowed to make ni connections 
towards nodes i with a probability 



cx kj 



(155) 



where ki is the degree of node i. We refer the interested 
reader to the various books and reviews which describe 
in detail this model [7, 70, 87, 113, 244, 255] . In particu- 
lar, the degree distribution behaves as a power-law with 
exponent P{k) ^ k^'^ with 7 = 3, the average shortest 
path behaves at the dominant order as logiV, and the 
average clustering coefficient is given by 



(C) 



8N 



(In NY' 



(156) 



while C{k) - 1/k. 

In many networks such as transportation or communi- 
cation networks, distance is however a relevant parameter 
and real-world examples suggest that when long-range 
links are existing, they usually connect to hubs-the well- 
connected nodes. Many variants of the BA model were 
proposed and a few of them were concerned with space. 
The growth process is the same as for BA, but in addition 
one has to specify the location of the new node. In most 
models, the location is taken at random and uniformly 
distributed in space. The attachment probability is then 
written as 



n„^i oc hF[dE{n,i)] 



(157) 



where is a function of the euclidean distance dE{n,i) 
from the node n to the node i. 

When F is a decreasing function of distance (as in most 
cases), this form (Eq. 157) implies that new links pref- 



2. Preferential attachment and distance selection 

Several models including distance were proposed [33, 
104, 177, 221, 271, 282, 310, 325, 329] and we review here 
the main results obtained in these studies. The N nodes 
of the network are supposed to be in a d-dimensional 
space of linear size L and we assume that they are dis- 
tributed randomly in space with uniform density p. One 
could use other distributions: For instance in cities the 
density decreases exponentially from the center [84] . The 
case of randomly distributed points is interesting since on 
average it preserves natural symmetries such as transla- 
tional and rotational invariance in contrast with lattices. 

Essentially two different cases were considered in the 
literature. 



a. Finite range case. In this case, the function F{d) de- 
creases sharply with distance, typically as an exponential 

[331 



F{d) 



-d/ra 



(158) 



and thus introduces a new scale in the system, the in- 
teraction range r^. When the interaction range is of the 
order of the system size (or larger), the distance is ir- 
relevant and the obtained network will be scale-free. In 
contrast, when the interaction range is small compared to 
the system size, we expect new properties and a crossover 
between these two regimes. 

The important dimensionless parameters are here the 
average number n of points in a sphere of radius Vc 



rd/2 



n — pr^ 



r(i 



(159) 



and the ratio which controls the importance of spatial 
effects 



rc 
L 



(160) 



(where L is the system size). It can then be shown [33] 
that the degree distribution follows the scaling form valid 
only for ?7 ^ 1 



p(fc) ^ k-y{k/k,) 



(161) 
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with 7 = 3 and where the cut-off kc behaves as 



kr 



(162) 



where /3 ~ 0.13. The distance effect thus Umits the choice 
of available connections and thereby limits the degree 
distribution for large values. 

Also, when the distance effect is important we expect 
a large value of the average clustering coefficient. In the 
limit of small ?7, we can expect the result of random ge- 
ometric graphs (see section IV. A. 1 and [104]) to hold 



Co EE (C)(7] = 0) = 1 - 3\/3/47r ~ 0.59 



(163) 



(for d — 2). We expect to recover this limit for 77 —> and 
for an average connectivity (fc) — 6 which is a well-known 
result in random geometry. If -q is not too small, the 
preferential attachment is important and induces some 
dependence of the clustering coefficient on n. In addition, 
we expect that {C){r]) will be lower than (C)(0) since the 
longer links will not connect to the nearest neighbors. 
Numerical results show that there is a good collapse (see 
Fig. 46a) when {C){rf) is expressed in terms of n and is 
a decreasing function since when n increases the number 
of neighbors of the node will increase thus decreasing the 
probability that two of them will be linked. 

The average shortest path is described by a scaling 
ansatz which governs the crossover from a spatial to a 
scale-free network 



{l{N,7i))^[N*(r^)r^, 



N 



with 



$„(a;) 




for a; <C 1 
for X ^ 1 



(164) 



(165) 



The typical size N* depends on 77 and we can find its 
behavior in two extreme cases. For r/ 3> 1, space is irrel- 
evant and 



iV*(?7>l)-iVo 



(166) 



where A^o is a finite constant. When 77 <C 1, the exis- 
tence of long-range links will determine the behavior of 
{£). If we denote by a = l/p^^'^ the typical inter- node 
distance, the transition from a large to a small-world will 
be observed for Tc ~ a{N*) which leads to 



(167) 



In Fig. 46b, the ansatz Eq. (164) together with the results 
Eqs. (166), (167) is shown. This data collapse is obtained 
for a ~ 0.3 and A^o ~ 180 (for d = 2). For A^ > A^*, the 
network is a small-world: the diameter is growing with 
the number of points as {£) ^ log A^. In the opposite case 
of the spatial network with a small interaction range. 



0.8 



0.6 



U 0.4 



0.2 



(a) 



Ti=mo 

11=1/100 

Ti=moo 

11=1/300 
Tl=l/400 
11=1/600 







10"' 10^ 10"' 10"' 10° 10' 10' 10' 10^ 




FIG. 46 (a) Clustering coefficient versus the mean number 
n = pTirl of points in the disk of radius (plotted in Log- 
Lin) . The dashed line corresponds to the theoretical value Co 
computed when a vertex connects to its adjacent neighbors 
without preferential attachment, (b) Data collapse for the 
average shortest path obtained. The first part can be fitted 
by a power-law with exponent « 0.3 followed by a logarithmic 
regime for N > N* . Both figures from [33]. 



the network is much larger: To go from a point A to 
a point B, we essentially have to pass through most of 
the points in between and the behavior of this network 
is much that of a lattice with {£) ~ A^", although the 
diameter is here smaller probably due to the existence of 
some rare longer links (in the case of a lattice we expect 
a = 1/d). Probably larger networks and better statistics 
are needed here. 

This model was extended [31] in the case of weighted 
growing networks in a two-dimensional geometrical 
space. The model considered consists of growth and the 
probability that a new site connects to a node i is given 
by 



(168) 



where is a typical scale and dni is the Euclidean dis- 
tance between n and i. This rule of strength driven 
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FIG. 47 Distance strength versus k for two different cases 
(j] = 0.02 and r\ = 10.0). When r\ is not small, space is irrele- 
vant and there are no correlations between degree and space 
as signalled by /3d « 1. When spatial effects are important 
(r; = 0.02 ^ 1), non-linear correlations appear and /3tj > 1. 
We observe a crossover for fe ~ 10 — 20 to a power-law be- 
havior and the power-law fit over this range of values of k is 
shown (full lines). From [31]. 
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FIG. 48 Average Euclidean distance between the barycenter 
G of all nodes and the 10 most central nodes (C) versus the 
parameter t) (Here 5 = 0, A'^ = 5, 000 and the results are 
averaged over 50 configurations). When space is important 
(ie. small r\), the central nodes are closer to the gravity center. 
For large space is irrelevant and the average distance tends 
to the value corresponding to a uniform distribution (r)„„i/ = 
2/3 (dotted line). From [31]. 



'preferential attachment with spatial selection, general- 
izes the preferential attachment mechanism driven by the 
strength to spatial networks. Here, new vertices connect 
more likely to vertices which correspond to the best in- 
terplay between Euclidean distance and strength. 

The weights are also updated according to the follow- 
ing rule already studied in another paper [30] 

+(5^. (169) 

for all neighbors j € T{i) of i. 

The model contains thus two relevant parameters: the 
ratio between the typical scale and the size of the system 
r] = rc/L, and the ability to redistribute weights, S. 

The most important results concerning the traffic are 
the following. The correlations appearing between traffic 
and topology of the network are largely affected by space 
as the value of the exponents /S^ and 13^ depend on 77 
(for /3d see Fig. 47). Strikingly, the effect of the spatial 
constraint is to increase both exponents /3w and f5d to 
values larger than 1 and although the redistribution of 
the weights is linear, non-linear relations s^{k) and s'^{k) 
as a function of k appear. For the weight strength the 
effect is not very pronounced with an exponent of order 
Puj ~ 1-1 for 77 = 0.01, while for the distance strength 
the non-linearity has an exponent of order (3^ « 1.27 for 
ri = 0.02. 

The nonlinearity induced by the spatial structure can 
be explained by the following mechanism affecting the 
network growth. The increase of spatial constraints af- 
fects the trend to form global hubs, since long distance 
connections are less probable, and drives the topology to- 
wards the existence of 'regional' hubs of smaller degree. 
The total traffic however is not changed with respect to 



the case 77 = 00, and is in fact directed towards these re- 
gional hubs. These medium-large degree vertices there- 
fore carry a much larger traffic than they would do if 
global hubs were available, leading to a faster increase of 
the traffic as a function of the degree, eventually resulting 
in a super-linear behavior. Moreover, as previously men- 
tioned, the increase in distance costs implies that long 
range connections can be established only towards the 
hubs of the system: this effect naturally leads to a super- 
linear accumulation of s''(fc) at larger degree values. 

The spatial constraints act at both local and global 
level of the network structure by introducing a distance 
cost in the establishment of connections. It is therefore 
important to look at the effect of space in global topolog- 
ical quantities such as the betweenness centrality. Hubs 
are natural crossroads for paths and it is natural to ob- 
serve a correlation between g and k as expressed in the 
general relation g{k) ~ A:^ where the exponent depends 
on the characteristics of the network and we expect this 
relation to be altered when spatial constraints become 
important (see section II. B. 2). In particular, the be- 
tweenness centrality displays relative fluctuations which 
increase as 77 decreases and become quite large. This 
can be understood by noticing that the probability to es- 
tablish far-reaching short-cuts decreases exponentially in 
Eq. (168) and only the large traffic of hubs can compen- 
sate this decay. Far-away geographical regions can thus 
only be linked by edges connected to large degree vertices, 
which implies a more central role for these hubs. The ex- 
istence of fluctuations means that nodes with small de- 
gree may have a relatively large betweenness centrality 
(or the opposite), as observed in the air-transportation 
network (see Fig. 10 and [146]). The present model de- 
fines an intermediate situation in that we have a random 
network with space constraints that introduces a local 
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structure since short distance connections are favored. 
Shortcuts and long distance hops are present along with 
a spatial local structure that clusters spatially neighbor- 
ing vertices. In Fig. 48 we plot the average distance 
d{G, C) between the barycenter G and the 10 most cen- 
tral nodes. As expected, as spatial constraints become 
more important, the most central nodes get closer to the 
spatial barycenter of the network. 



b. Power law decay of F{d). 
in Eq. (157) is varying as 



In this case, the function F 



Fid) = 



(170) 



This problem was considered in [221, 325, 329]. The nu- 
merical study presented in [325] shows that in the two- 
dimensional case, for all values of a the average shortest 
path behaves as logA^. The degree distribution is how- 
ever different for a > —1 where it is broad, while for 
a < —1, it is decreasing much faster (the numerical re- 
sults in [325] suggest according to a stretched exponen- 
tial). 

In [221], Manna and Sen study the same model but 
for various dimensions and for values of a going from 
— cx) to -|-oo where the node connects to the closest and 
the farthest node, respectively (Fig. 49). These authors 
indeed find that ii a > the network is scale-free and in 
agreement with [325] that ac(c^ = 2) = — 1 while for large 
dimensions ac decreases with d (the natural guess ckc — 
1 — d is not fully supported by their simulations). This 
study was complemented by another one by the same 
authors [281] in the d = 1 case and where the probability 
to connect to a node i is given by (which was already 
proposed in [329]) 



fcfd£(n,z)" 



(171) 



For a > etc = —0.5 the network is scale-free aX f3 — 1 with 
an exponent 7 = 3. They also find a scale-free network 
for a line in the a — /3 plane and also for f3 > 1 and 
a < —0.5. The degree-dependent clustering coefficient 
C(fc) behaves as 



C{k) - fc- 



(172) 



where the authors found numerically that b varies from 
to 1 (which is the value obtained in the BA case). 



3. Growth and local optimization 

a. A model for the Internet. In the Internet, the degree 
distribution is a power law (see for example [255] and 
references therein) and in order to explain this fact, com- 
puter scientists [126] proposed a tree growth model with 
local optimization. More precisely, at each unit time, a 
new node i is connected to an already connected node j 



a= 



a = -l 




a=-2 



a= -00 



FIG. 49 Various networks obtained with the rule F{d) — cP 
From [221]. 



such that the quantity 



£ = XdE{i,j) + hj 



(173) 



is minimum. The quantity hj is a measure of central- 
ity which can be either the average shortest path length 
from j, or the shortest path length to a root node, etc. A 
new node would then typically like to connect to a very 
central node but is limited by the cost measured by the 
distance. There will thus be an interplay between these 
two constraints and we can explore the two extremes: for 
small A, we obtain a star network. In the opposite case, 
when A is large, only distance is important and we obtain 
some sort of dynamical version of the Euclidean mini- 
mum spanning tree. If A has some intermediate values 
(A growing slower than v'iV and is larger than a certain 
constant), we can obtain a network with a power law de- 
gree distribution whose exponent 7 depends on A. For 
instance if A ~ N^^^ then 7=1/6 (see [126] for other 
results on this model). 
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Even if the resulting network is not obtained by min- 
imizing some global function, the addition of new nodes 
minimizes a cost function and in this sense we can speak 
of a local minimization. 



b. A model For distribution networks. Many networks, in- 
cluding transportation and distribution networks evolved 
in time and increased their service area. Clearly, in these 
situations the resulting networks are growing and cannot 
result from a global optimization but instead, local opti- 
mization could be a reasonable mechanism for explaining 
the organization of these structures. 

In the example of a transportation network such as the 
train system, the nodes represent the train stations and 
the edges the rail segments between adjacent stations. 
In many of these systems, there is also a root node which 
acts as a source of the distribution system or in the case 
of the railway can be considered to be the central station. 
During the evolution of the network at least two factors 
could be considered. First, the total length of the system 
which represents the cost of the infrastructure should not 
be too large. Space has another important role here: 
the transportation system should also allow to connect 
two nodes in the network through a shortest path whose 
length is not too far from the 'as crow flies' distance. This 
efficiency is for example measured by the route factor -or 
detour index - (see section II.B.2.c) which for two nodes 
i and j of the network reads 



(see Fig. 50). In this figure, we see that the route factor 



Qihj) 



For a system with a root node 0, one can then compute 
the route factor as the average over all nodes except 



-y 



(174) 



Following these two requirements, Gastner and New- 
man [135] proposed a model of a growing network where 
vertices are initially randomly distributed in the two- 
dimensional plane and where one vertex is designated 
as the root node 0. A network is then grown from its 
root by adding an edge between an unconnected node i 
to a vertex j which belongs to the network. The edge 
is chosen according to a local minimization process such 
that the quantity 



dEjiJ) + rffl(j, 0) 
dEihO) 



(175) 



is minimum and where a > is here a parameter con- 
trolling the importance of the route factor. For a = 0, 
the algorithm adds always a link to the closest vertex 
and the resulting network is similar to the MST and has 
a poor route factor [135]. When a increases the route 
factor decreases and the average edge length J (not be 
confused with the average shortest path {£)) increases 
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FIG. 50 Route factor q and average edge length I versus a 
in Eq. (175). These results are obtained for A'^ = 10'' vertices 
and in the inset the network is obtained for a = 12. From 
[135]. 

q decreases sharply when a increases from zero, while the 
average edge length -which is a measure of the building 
cost of the network- increases slowly. This suggests that 
it is possible to grow networks with a small cost but with 
a good efficiency. 

Gastner and Newman [135] also studied a simpler ver- 
sion of this model where the local minimization acts on 
the quantity 



^ u — dh 



{hj) + Pdji{j,0) 
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which is similar to the model for the Internet proposed 
by Fabrikant, Koutsoupias, Papadimitriou [126] (see also 
section IV.D.3.a) but where the vertices are added one by 
one. This model produces networks similar to the one ob- 
tained with Eq. (175) and self-organizes to networks with 
small q which is not imposed here. This model can be ap- 
plied to the set of stations of the Boston rail network and 
produces a network in good correspondence with the real 
one. Also, the small value of q is confirmed in different 
empirical examples such as sewer systems, gas pipelines, 
and the Boston subway [135] where the ratio 1/Imst is 
in the range [1.12, 1.63] while the route factor is less than 
1.6 (and compared to the MST is improved by a factor 
in the range [1.4, 1.8]). 

The networks obtained here are trees which is a simpli- 
fication for many of the real-world networks which usu- 
ally contain loops. In addition, there is also usually an 
interaction between the density of points and the net- 
work and this co-evolution is not taken into account here. 
However, this simple model of local optimization seems to 
capture important ingredients and could probably serve 
as a good starting point for further improvements. 
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FIG. 51 M is the closest network's point to both centers A 
and B. The road will grow to point M' in order to maxi- 
mally reduce the cumulative distance A of A and B from the 
network. From [39]. 



c. A model for street and road networks. In the case of ur- 
ban street patterns, striking statistical regularities across 
different cities have been recently empirically found, sug- 
gesting that a general and detail-independent mechanism 
may be in action (see section III.B.l). The rationale to 
invoke a local optimality principle in this context is as in 
the case of distribution networks based on costs: every 
new road is built to connect a new location to the existing 
road network in the most efficient way [47]. The locality 
of the rule is implemented both in time and space during 
the evolution and formation of the street network, in or- 
der to reflect evolution histories that greatly exceed the 
time-horizon of planners. The self-organized pattern of 
streets emerges as a consequence of the interplay of the 
geometrical disorder and the local rules of optimality. 

This local optimization process proposed in [39] can be 
understood with the simple following example (Fig. 51). 
We assume that at a given stage of the evolution, two 
nodes A and B still need to be connected to the network. 
At any time step, each new node can trigger the con- 
struction of a single new portion of road of fixed (small) 
length. In order to maximally reduce their distance to 
the network, both A and B would select the closest points 
Ml and M2 in the network as initial points of the new 
portions of roads to be built. If Mi and M2 are distinct, 
segments of roads are added along the straight lines Mi A 
and M2B. If Ml = M2 = M, it is not economically rea- 
sonable to build two different segments of roads and in 
this case only one single portion MM' of road is allowed. 
The main assumption here is that the best choice is to 
build it in order to maximize the reduction of the cumu- 
lative distance A from M to A and B 

A^[d{M,A) + d{M,B)]-[d(M',A)+d(M',B)] (177) 

The maximization of A is done under the constraint 
]MM'] = const. <^ 1 and a simple calculation leads to 



MM' (X UA + ub 



(178) 



where ua and ub are the unit vectors from M in the di- 
rection of A and B respectively. The rule (178) can easily 
be extended to the situation where more than two centers 
want to connect to the same point M. Already in this 
simple setting non-trivial geometrical features appear. In 



the example of Fig. 51 the road from M will develop a 
bended shape until it reaches the line AB and intersects 
it perpendicularly as it is commonly observed in most 
urban settings. At the intersection point, a singularity 
occurs with ua + ub ~ and one is then forced to grow 
two independent roads from the intersection to A and B. 

Interestingly, we note that although the minimum ex- 
penditure principle was not used, the rule Eq. (178) was 
also proposed by Runions et al. [273] in a study about 
leaf venation patterns. 

The growth scheme described so far leads to tree-like 
structures which are on one side economical, but which 
are hardly efficient. For example the path length along 
a minimum spanning tree scales as a power 5/4 of the 
Euclidean distance between the end-points [97, 116] and 
better accessibility is granted if loops are present. The 
authors of [39, 40] followed [273] and assumed that a new 
node can trigger the construction of more than one por- 
tion of road per time step, leading to the existence of 
loops. This model produces realistic results (some ex- 




FIG. 52 Snapshots of the network at different times of its 
evolution: for (a) t = 100, (b) t = 500, (c) t = 2000, (d) 
t — 4000 (the growth rate is here 77 = 0.1). At short times, 
we have almost a tree structure and loops appear for larger 
density values obtained at larger times (the number of loops 
then increases linearly with time). From [39]. 

amples are shown in Fig. 52), which are in good agree- 
ment with empirical data which demonstrates that even 
in the absence of a well defined blueprint, non-trivial 
global properties emerge. This agreement confirms that 
the simple local optimization is a good candidate for the 
main process driving the evolution of city street patterns 
but it also shows that the spatial distribution of nodes 
p(r) is crucial. Concerning this last aspect, a more gen- 
eral model describing the co-evolution of the node distri- 
bution p{r) and the network was proposed in [40]. 

Another interesting model was proposed recently by 
Courtat, Gloaguen and Douady [100]. In this model, 
each new 'settlement' is added every time step at a cer- 
tain location and connects to the existing infrastructure 
network in a way similar to [39]. The city generates a 
spatial field describing the 'attractiveness' of every point. 
This potential field has a hard repulsion term at short dis- 
tances and a large distance behavior proportional to the 
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total 'mass' of the city (measured by the total length of 
its roads) and decreasing as a power law with distance. 
The sum of all influences of all roads then produce local 
minima and each new node has its own policy. Among 
others, the parameter lu is related to the 'wealth' of the 
node and controls the number of roads constructed to 
connect a new settlements: if a; = 1 all possible roads 
are constructed and in the opposite case w = 0, only the 
shortest road is built. Another important parameter in 
this model is the probability to be at the optimal loca- 
tion controlled by the potential. For small Pf. the city is 
'unorganized' and nodes are added randomly, while for 
large Pe the probability to stick to the optimal location is 
large. In Fig. 53, we show the We show in Fig. 53 differ- 




FIG. 53 Simulations of the model proposed by Courtat, 
Gloaguen, Douady [100]. The different networks are obtained 
for various values of the parameters Pe and u). At small values 
of Pe, the city is organized while for large Pe, the nodes are 
added at random. For small uj, a minimum number of roads 
are constructed to connect new nodes while in the opposite 
cases all possible connections are added. From [100]. 

ent networks obtained by varying the two parameters Pe 
and w. The variety of networks obtained by varying two 
parameters only is remarkable - going from a 'favella-like' 
organization (for large Pe) to a highly organized area (for 
small Pe)- and suggests that this approach could provide 
an interesting first step for the modelling of urban street 
networks. 

Finally, we end this section by noting that there are 
many other models for roads and urban structures and 
that we focused here on a physicist-like approach with 
minimal models. In particular, there are many studies 
on these problems done by geographers (with usually the 
help of cellular automata) and we refer the interested 
reader to articles in the handbook [44] for more refer- 
ences. 



E. Optimal networks 

Although one of the main pillars in complex systems 
studies is the emergence of a collective behavior without 
any central planning, it is usually a matter of time scale 
compared with the typical time horizon of planners. On 
a short time scale it is reasonable to assume that plan- 
ning plays a role and that the system under consideration 
evolves by using an optimization process. On a larger 
time scale, most systems result from the addition of suc- 
cessive layers and even if each of these layers is the result 
of an optimization process, it is very likely that the long 
time result is not an optimum. 

Optimization is however of great importance in many 
practical engineering problems and both the problem of 
optimal networks [180] and of optimal traffic on a net- 
work [4, 309] have a long tradition in mathematics and 
physics. It is well known, for example that the laws that 
describe the flow of currents in a resistor network [193] 
can be derived by minimizing the energy dissipated by 
the network [115]. On the other hand, optimal networks 
have been shown to be relevant in the study of mam- 
malians circulatory system [228], food webs [132], general 
transportation networks [26], metabolic rates [317], river 
networks [223], and gas pipelines or train tracks [134]. 
All these studies share the fact that the nodes of the net- 
work are embedded in a d-dimensional euclidean space 
which implies that the degree is almost always limited 
and the connections restricted to 'neighbors' only. 

A second broad class of optimal networks where spa- 
tial constraints are absent has been also recently investi- 
gated. It has been shown, for example, that optimization 
of both the average shortest path and the total length 
can lead to small- world networks [226], and more gener- 
ally, degree correlations [48] or scale-free features [302] 
can emerge from an optimization process. Cancho and 
Sole [169] showed that the minimization of the average 
shortest path and the link density leads to a variety of 
networks including exponential-like graphs and scale-free 
networks. Guimera et al. [148] studied networks with 
minimal search cost and found two classes of networks: 
star-like and homogeneous networks. Finally, Colizza et 
al. [90] studied networks with the shortest route and the 
smallest congestion and showed that this interplay could 
lead to a variety of networks when the number of links 
per node is changed. 

Finally, we note that optimal networks could serve as 
interesting null models. For example, the (Euclidean) 
minimum spanning tree (MST) is obtained by connect- 
ing all the nodes with a minimum total length and as an 
illustration we show in Fig. 54 the MST obtained for the 
set of stations present in the Paris subway in 2009. In 
this subway case the total length is directly connected to 
the cost of the network and the MST represents the most 
economical network. The MST has however some draw- 
backs such as a large average shortest path and a large 
vulnerability to failure, and the real subway network has 
obviously many redundant links. It is however interesting 
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FIG. 54 Top: Paris subway network (2009). Bottom: Cor- 
responding Minimum spanning tree. 



to understand the interplay between costs and efRciency 
by comparing the actual network with the MST. 

As we can see optimal networks appear in many dif- 
ferent branches such as mathematics, physics and also in 
engineering. This subject is in fact so broad that it would 
deserve a whole review to explore its various aspects. In 
this chapter, we thus made the choice to restrict our- 
selves to the most recent and relevant statistical studies 
involving optimal networks and space. 



1. Hub-and-spoke structure 

An important example of the result of an optimization 
process is the hub-and-spoke structure. In this structure, 
direct connections are replaced with fewer connections 
to hubs which form a network at a larger scale. The 
hub-and-spoke structure reduces the network costs, cen- 
tralizes the handling and sorting, and allows carriers to 
take advantage of scale economies through consolidation 
of flows. Such networks have widespread application in 
transportation and became in particular pregnant in air- 
line transportation and has its cause mainly in the fact 
that a carrier has to minimize its costs even if by doing 
so the average traveling time for user is not minimized 
(see the Fig. 55 for an illustration of this difference of 
designs). 

One of the first cases where the hub-and-spoke system 
was observed is in the US with Delta Air Lines which 
had its hub in Atlanta (GA). FedEx also adopted this 




User-oriented design Construction -oriented design 

FIG. 55 Schematic comparison between a transportation net- 
work which optimizes the user travel time and distance (left) 
and network which minimizes the construction costs (right). 

system in the 1970s and after the airline deregulation in 
1978 most of the airline carriers adopted it. This system 
is based on the construction of regional hubs and to the 
creation of major routes between these hubs (see Fig. 56). 
One can easily compare the point-to-point (P2P) transit 




Point-to-point Hub-and-spoke 



FIG. 56 Schematic comparison between a point-to-point 
transit model which optimizes the user travel time and dis- 
tance (left) and a hub-and-spoke system (with two hubs 
Hi and H2) which minimizes the total operating costs by 
bundling together the traffic between the two hubs. 

model and the one hub-and-spoke model for N nodes. 
In the hub-and-spoke model, there are of the order N 
routes to connect all nodes, while in the P2P the num- 
ber of routes it is of the order N'^. In transportation 
systems such as airlines, this will lead to fuller planes, 
larger benefits, and expensive operations such as sorting 
and accounting packages will be done at the hub only. 
Obviously, there are also drawbacks. The centrality of 
the hubs is very large and overloading at a hub can have 
unexpected consequences throughout the whole network. 
In other words, the hub is a bottleneck in the network 
and in agreement with general results on networks, hubs 
also constitute the vulnerable points in the network, a 
failure at a hub being able to trigger large delays which 
can propagate in the whole network. 

An important problem in transportation research is to 
locate the hub(s) in a system of N nodes such that a 
given cost function is minimum. Analytical research on 
this hub location problem began relatively recently with 
the study [252] and the interested reader can find a short 
review on this subject in [65]. In the following, we will 
detail one of the simplest case, known as the linearized 
multiple assignement model and which can be formulated 
as follows. We assume that in a system of TV nodes, there 
is a known origin-destination matrix Tij (ie. the flow 
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between i and j, see section III.C.l). We denote by Cij 
the transportation cost per individual to go from i to j 
and the importance of hubs is in the fact that there is 
a discount factor < a < 1 on the hnks among them. 
For example in the airline case, bundling flows allows to 
use for example larger aircrafts and reduce the passenger- 
mile costs. This discount factor implies that if k and m 
are two hubs, then the interhub cost on fc — m is given 
by aCkrn- If we denote by Xij^km the fraction of the flow 
from i to j which is routed via the hubs k and m, the 
function to be minimized is then 

£ — Tij[Cik + CtCkm + Cmj\Xij^krn (179) 

ijkTTi 

under the constraints 

Y^Zk^p (180) 
fc 

^-^ij,fcm = l (181) 
km, 

Zfc < (182) 

m 

Xjj^km — Z„i < (183) 

fc 

where Zj. = 1 if fc is a hub and zero otherwise. We look for 
a solution of this problem with a fixed number p of hubs. 
The two last constraints ensure that the flow will not be 
routed via fc and m unless both fc and m are hubs. This 
type of minimization will lead to solutions schematically 
shown in Fig. 56 where the reduction of the number of 
links is made possible by the establishment of hubs and 
the bundling of flows between the hubs. For these op- 
timal networks, the total network cost is minimum but 
obviously individual travel times are larger (as compared 
with a point-to-point network). 

We just gave the flavor here of this type of approaches 
and there is a huge amount of studies on this prob- 
lem that we cannot cite here because of lack of space. 
Basically, there is a more practical research direction 
which amounts to add more characteristics of real-world 
networks [65] such as transfer delays for example [253]. 
There is also a more mathematical point of view (see for 
example [11]) where optimal properties are discussed. At 
this point however, many problems are still unsolved and 
a statistical approach on the large network properties of 
this hub-and-spoke minimization problem is probably an 
interesting direction for future research. 



2. Congestion and centralized organization 

In many real-world cases however the pure hub-and- 
spoke structure is not present and we observe a ring struc- 
ture around a complicated core or an effective hub (see 
for example Fig. 57). An interesting discussion on cen- 
tralization versus decentralization from the perspectives 



of the minimum average shortest path and of the effect 
of congestion can be found in [19, 171]. The idea is then 




FIG. 57 Examples of hub-and-spoke structures with rings. 
(a-c,e): Typical fungi networks, in (c) a schematic represen- 
tation of the nutrient flow is shown, (d) The model studied 
in [19, 112, 171] with spokes radiating from a hub. (f) Road 
network in Houston showing an inner hub with a complicated 
structure. From [171]. 

to study the competition between the centralized orga- 
nization with paths going through a single central hub 
and decentralized paths going along a ring and avoiding 
the central hub in the presence of congestion. A simple 
model of hub-and-spoke structure together with a ring 
was proposed in [112]. In this model, N nodes are on a 
circle and there is hub located at the center of the circle 
(see Fig. 58). A radial link - a spoke - is present with 




FIG. 58 Models proposed in [112] and studied in [19, 171] 
with congestion. A central site is connected to a site on a ring 
with probability p. In (a) all the links on the ring are directed 
and in (b) these links are not directed. From [112]. 

probability p. When computing the shortest path, these 
spokes are assumed to count for 1/2. In other words, the 
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number of hops to go from one site on a ring to another 
via the central hub (when it is possible) is then equal to 
1. In the case of directed links, the shortest path distri- 
bution is easy to compute and one finds [112] 



-[i + {t- i)p + £(iv - 1 - V)/] [i-pY 



P(i) ^ — 

(184) 

In particular, we can check that the two limiting cases 
are correct: for p ^ 0, we obtain P{i) — > 1/iV — 1 giving 
{£) = N/2 and for p ^ 1, P{t} (J^j and {tj = 1. The 
general expression for the average shortest path is 

JSl — 1 \_ p p-^ p p p 

(185) 

In the undirected case, we have more paths going from 
one site to the other and the enumeration is a little bit 
more tedious. The result for the shortest path distribu- 
tion now reads [112] 



P{t = 1) = 
P{£ >2)^ 



N - 1 
1 



1 



TV- 3 



-P 



2 3 (186) 
ao -I- aip + a2P + a^p 



N- 1 
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where 



+ aip^]{l~py 



ao = 2 

ai ^ A{i-2) 

02 = 2(^-l)(2iV-4^-3) 
ag = -2(2£- l)(iV- 2^- 1) 
04 = 1{N -2e~l) 



(187) 



For both these models a continuous limit can be defined 
by taking the limit N ^ oo and p — > with p = pN 
and z = £/N fixed. The shortest path distribution then 
converges to (in the undirected model) 



NP{e) ^ 2[1 + 2pz + 2p2(l - 2z)]e 



-2pz 



(188) 



The interesting observation made in [19] is that if we now 
add a cost c each time a path goes through the central 
hub, we could expect some sort of transition between a 
decentralized regime where it is less costly to stay on the 
peripheral ring to a centralized regime where the cost is 
not enough to divert the paths from the central hub. The 
cost could in general depend on how busy the center is 
and could therefore grow with the number of connections 
to the hub. In the case of a constant cost c, the expression 
for the average shortest path is now [19] 



P(£) = < 

^ ' ^ 7^A^ + bip + hp^]{i-pY 



N- 



for e<c 
for £> c 
(189) 



where 6i = £ - c - 1 and 62 = (iV - 1 - £)(^ - c). For 
paths of length i < c, there is no point to go through 
the central hub. In the opposite case, when ^ > c, we 
recover a distribution similar to the c = case in [112]. 
The average shortest path is then 



{l-p^-^iS + iN 



c)p] 



p2(iV- 1) 

p[2 -2c + 2N -{c - l)(c - N)p\ 



p^{N -1) 



c(c-l) 
2(A^- 1) 
(190) 



In the continuous limit (p — 0, iV — > 00 and z = i/N and 
p — pN fixed) , the average shortest path is a function of 
the various parameters {£) = {£) {p, c, N) . In the case of 
costs increasing linearly with p, the average shortest path 
displays a minimum when p is varied {N and c being 
fixed). Indeed for p — >• 0, there are no spokes and {£) 
scales as A^. In the opposite case p large the cost is also 
large and it is less costly to go along the ring. In [19], 
the authors used a simple approximation and found that 
(with c = kp), the optimal value of p is 



P 



a result that is confirmed numerically, 
actually be rewritten as 



(191) 



This result can 



pc{p) 



1 



(192) 



which means that the optimal situation is obtained when 
the average cost of a radial trip through the central hub 
is of order one: when c is too large, this trip is too costly 
and when p is too small, the existence of this path is too 
unlikely. The same argument applied to nonlinear cost 
c ~ kp^ gives the scaling 



(N/k) 



1/3 



(193) 



For this optimal value, the minimum average shortest 
path is then of the order the cost 



{£)min ~ C(p*) 

In the linear case c — kp, one obtains 



(194) 



~ VkN (195) 
and in the nonlinear case c ^ kp^, one obtains 

{£)m^n ^ {kN'^f^ (196) 

These expressions and arguments apply essentially both 
to the directed and non-directed model. 

This study [19] was generalized in [171] to the case of a 
more complicated cost function such as c(p) = Cp+Bp'^ + 
Ap^ where the authors observe different behaviors and a 
phase transition according to the values of the coefficients 



A, B, and C. 

These studies show how congestion could have an im- 
portant impact on the a priori optimal hub-and-spoke 
structure and favor the transport along a ring. From a 
more general perspective it would indeed be interesting 
to observe empirically this transition from a purely radial 
structure and the appearance of a ring for a congestion 
large enough, as it is observed in many urban structures 
for example. 

3. From the MST to the SPT 

There is a obviously a huge literature on optimal net- 
works and we are obliged to restrict ourselves to a small 
subset of subjects. We will discuss here the prototypes 
of optimal networks allowing us to introduce some of the 
main objects such as the minimum spanning tree or the 
shortest path tree. Concerning the topic of connected 
networks over random points which minimize quantities 
such as the total route-length, the interested reader can 
find a recent and modern account about mathematically 
tractable models in a series of papers by Aldous [10-12] 
and in the references therein. 

We will not introduce the different networks one by one 
but we will adopt a different point of view by introducing 
a generalized energy inspired by studies on optimal traf- 
fic networks [38]. The problem is, given a set of N nodes 
located in a 2-dimensional plane, to find the tree connect- 
ing them and which minimizes the following quantity 

= J2 (197) 

eer 

where is the Euclidean length of a link (or it could be 
another measure of its cost) and 6e is the betweenness 
centrality of the link (see section II.B.l.d). The expo- 
nents /i and v control the relative importance of distance 
against topology as measured by centrality. Fig. 59 shows 
examples of spanning trees obtained for different values 
of 

Case (/i, v) — (0, 1). The energy is then 

^0,1 = Y.'^- (198) 

eer 

and represents the total length of the network. In real- 
world cases, the cost of the network is connected to 
this quantity and it makes sense to try to minimize 
it. Since the nodes are in space, the optimal resulting 
tree is called Euclidean MST. An example of the two- 
dimensional MST is shown in the Fig. 59(a) and many 
quantities can be calculated in the limit of a large num- 
ber of nodes. For example, for the 2d case, the length 
M{N) of the longest edge scales as [258] 
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FIG. 59 Different spanning trees obtained for different 
values of (^, u) in Eq. (197) obtained for the same set of 
A'' = 1000 nodes, (a) Minimum spanning tree obtained for 
(/I,!/) = (0, 1). In this case the total distance is minimized, 
(b) Optimal traffic tree obtained for (/i, i^) = (1/2,1/2). In 
this case we have an interplay between centralization and min- 
imum distance resulting in local hubs, (c) Minimum euclidean 
distance tree obtained for (/x, u) = (1, 1). In this case central- 
ity dominates over distance and a 'star' structure emerges 
with a few dominant hubs, (d) Optimal betweenneess cen- 
trality tree obtained for (/i, v) — (1, 0). In this case we obtain 
the shortest path tree which has one star hub (for the sake of 
clarity, we omitted some links in this last figure). From [38]. 

which is larger by a non-trivial factor ViniV than the 
typical internode distance which scales as 1/^/N. Also, 
the total length £t{N) of the c?-dimensional MST scales 
as [292] 

£j.(iV) ^ ATi-i/rf /" p{x)'^dx (200) 

where p{x) denotes the point distribution in M''. For a 
uniform distribution in rf = 2, we thus obtain a scaling of 
the form £t{N) ^ \/N. This scaling can be understood 
with the following simple argument. In the case of a 
uniform density p, the typical inter-node distance is given 
by Xjp^l'^ which scales as XjN^I'^ and we have E (x N 
links which reproduces the scaling £t{N) ^ N x N^^^'''. 

We note here that an interesting study [320] shows that 
the minimum spanning tree for weighted networks can be 
partitioned into two distinct components: the 'superhigh- 
ways' with large betweenness centrality nodes and the 
'roads' with lower centrality. It would be interesting to 
study the consequences of these results for the euclidean 
minimum spanning tree. 

There are many other studies on the MST and it is 
impossible to quote them all here. We just end this para- 
graph by noting that the MST has many interesting con- 
nections with other problems of statistical physics such 
as invasion percolation, directed polymers in random me- 
dia, random resistor networks and spin glasses (see for 
example [110, 170, 264]). 



(199) 



Case {fJ,,iy) = (1/2, 1/2). We obtain here the optimal 
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FIG. 60 Betweenness centrality distribution for the MST 
and for the OTT. The hnes are power law fits and give for 
the MST the theoretical resuh 5 = 4/3 and for the OTT the 
value S ~ 2.0 (A^ = 10"*, 100 configurations). From [38]. 




FIG. 61 Hierarchical organization emerging for the optimal 
traffic tree {fi,u) = (1/2, 1/2) (iV = 10^ nodes). Longer links 
lead to regional hubs which in turn connect to smaller hubs 
distributing traffic in smaller regions. From [38]. 



traffic tree (OTT) shown in Fig. 59(b) and which displays 
an interesting interplay between distance and shortest 
path minimization [38]. It has been shown that trees 
can be classified in 'universality classes' [222, 297] ac- 
cording to the size distribution of the two parts in which 
a tree can be divided by removing a link (or the sub- 
basins areas distribution in the language of river net- 
work). We define Ai and Aj as the sizes of the two parts 
in which a generic tree is divided by removing the link 
The betweenness bij of link (i,j) can be written 
as bij = l[A^{N - Ai) + Aj{N - Aj)], and the distri- 
butions of A' s and b's can be easily derived one from 
the other. It is therefore not surprising that the same 
exponent S characterizes both P{A) ~ A~^ and P{b). 
While we obtain the value S = 4/3 for the MST [297], 
for the OTT we obtain (Fig. 60) an exponent 5 ~ 2, 
a value also obtained for trees grown with preferential 
attachment mechanism [7] (see also [265] for a support- 
ing argument). Interestingly, most real- world networks 
are also described by this value 6 ~2 [139]. The OTT 
thus tends to have a more uniform centrality with re- 
spect to the MST [320] , with important consequences on 
the vulnerability of the network since there is no clearly 
designated 'Achille's heel' for the OTT. 

The traffic properties of the OTT are also interesting 
as the traffic scales as Tij ~ djj with r « 1.5 showing that 
large traffic is carried over large distance and is then dis- 
patched on smaller hubs that distribute it on still smaller 
regions. Despite the limited range of degrees, we also 
observe for the strength [29] Si = J^j "^ij * superlinear 
behavior with the degree. This result demonstrates that 
the existence of degree-traffic correlations as observed for 
the airport network [29] could emerge from a global opti- 
mization process. The spatial properties of the OTT are 
also remarkable and displays (Fig. 61) a hierarchical spa- 
tial organization where long links connect regional hubs, 
that, in turn are connected to sub-regional hubs, etc. 

Case {^i^v) — (1,1). The energy is then given by 



bede which is proportional to the average shortest 
path with weights given by the euclidean distance. If 
we think of the betweenness centrality as a proxy for the 
traffic, the energy also represents the total length trav- 
eled on the system. An example of such a tree is shown 
in Fig. 59(c). 

Case (/i, ly) ~ (1, 0). In this case, the energy Eq. (197) 

is proportional to the average betweenness centrality 
which is also the average shortest path °^ ^- 

tree (1,0) shown in Fig. 59d is thus the shortest path tree 
(SPT) with an arbitrary 'star-like' hub (a small non zero 
value of I' would select as the star the closest node to the 
gravity center). The total length here scales as 

iriN) - N (201) 

with a prefactor proportional to the linear size of the 
system. Also, we note that if we were not restricted to 
trees, the solution would of course be the complete graph 
which gives an average shortest minimal and equal to one. 

The minimization of Eq. (197) thus provides a natural 
interpolation between the MST and the SPT, a prob- 
lem which was addressed in previous studies [190]. The 
degree distribution for all cases considered above (with 
the possible exception of (/z, v) ~ (1, 1) - a complete in- 
spection of the plane (/z, v) is still lacking) is not broad, 
possibly as a consequence of spatial constraints. 

a. The Steiner problem. More generally, finding a sub- 
graph that optimizes a global cost function is a very im- 
portant problem. The Steiner tree is one of the classical 
(NP-complete) problem which is still studied now [45] 
because of its importance in many fields such as network 
reconstruction in biology, Internet multicasting, circuit 
and distribution network design. It is defined as follow- 
ing: given a graph with positive weights, it consists in 
finding a connected subgraph of minimum weight that 
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contains a selected set of 'terminal' vertices. Such a sub- 
graph may require the addition of some 'non-terminal' 
nodes which are called Steiner nodes (see Fig. 62 for an 
illustration of the three nodes problem). The difference 
between the Steiner tree problem and the minimum span- 
ning tree problem is that in the Steiner tree problem ex- 
tra intermediate vertices and edges may be added to the 
graph in order to reduce the length of the spanning tree. 



B 



T Steiner node 

ic 



FIG. 62 Example of the Steiner tree for three points A, 
B, and C. The Steiner point S is located at the Fermat 
point of the triangle ABC and minimizes the total distance 
d{S, A) + d{S, B) + d{S, C). The angles are equal {SB, SA) = 
[SA, SC) = [SC, SA) = 2n/3. 



In [45] , the authors show that this problem can be an- 
alyzed with tools " such as the cavity equation - coming 
from statistical physics of disordered, frustrated systems 
(such as spin glasses, see for example [229]) and can lead 
to new optimization algorithms. 

The Steiner problem is obviously important in trans- 
portation science too, where the nodes are considered as 
cities and the Steiner points as junctions. Recent results 
on this topic can be found in [10] where the author pro- 
poses a rigorous study of the network which minimizes 
the quantity 



(202) 



where /3 is a positive exponent, c?e is the length of the 
link e, and where /(e) is a function describing the flow 
on edge e. In this case, the minimum cost can be shown 
to scale as 



with 



a(/?) 




for /3 e]0,l/2] 
for /3 e [1/2,1] 



(203) 



(204) 



There is a transition at /3 = 1/2 which corresponds to a 
change of regime where short links dominate to another 
one where long links dominate [10]. 



4. Adding two antagonistic quantities 

After having briefly reviewed the important optimal 
networks such as the MST and the SPT, we now discuss 
some examples which were recently proposed in the lit- 
erature. In particular, many models of optimal networks 
minimize a functional £ which is the sum of two terms 
which have have antagonistic behavior 



f = Afi + (1 - X)£2 



(205) 



where A G [0, 1] and where £i is for example a decreasing 
function of the number of links and could represent the 
efficiency (such as the average shortest path for example) 
and where £2 is an increasing function of the number of 
links for example and could represent the cost as mea- 
sure by the total length for example (see Fig. 63). This 
interplay will lead in many case to interesting solutions. 



Cost' 




'1 /efficiency' 



Optimum Number of links 



FIG. 63 Schematic representation of many optimal network 
models. The functional to minimize is the sum of two terms 
which behave in opposite way when adding links. This leads 
in general to a non-trivial optimum. 

We will now discuss some simple examples that were 
considered in the literature. Already in the 1970s optimal 
network design was a very active field (see for example 
[52]) and we will focus here on the more recent results. 



a. Link density and average shortest path. Ferrer i Can- 
cho and Sole [169] minimized the simplest form which is 
the combination of the average (normalized by the max- 
imum) shortest path 



(206) 



and the average link density p = 2E/N{N — 1) and reads 

£ = Xd+{1- X)p (207) 

where A E [0,1]. For A = 0, the energy is the density 
and the optimal network is a tree. In the opposite case 
A = 1, the optimal network minimizes the average short- 
est path. For intermediate values of A, Cancho and Sole 
obtained different networks (see Fig. 64) and in particu- 
lar argue that one of the regimes is a scale-free network 
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FIG. 64 Optimal networks obtained with the minimization 
of Eq. (207) for different values of A. The y— axis represents 
the degree entropy H = — X^fcPfclogPfc ^^'^ small for a 
heterogeneous network (and large for a lattice). From [169]. 

with a broad distribution of degrees. They obtained for 
the network denoted by 'B' in Fig. 64 a power law with 
exponent —2, for a network of size N = 100 (obviously 
better statistics on larger networks would be necessary 
to confirm this result, but this is not an obvious task). 
This shows that optimization might be a candidate for 
explaining the emergence of scale-free features. 

b. Total length and average shortest path. Aldous [11] 
studied the network which minimizes the total length 
subject to the average number of hops. More precisely 
the cost associated to a route 7r(i, j) = 6162 . . . e^n going 
from node i to node j is 

C{-k) ^mA + a^de (208) 

where A and a are two constants (in [11], the quantity 
a is chosen to scale as 1/ -\/]V in order to obtain terms of 
the same order of magnitude) . The quantity A measures 
the cost of connections and we expect that the larger it 
is and the smaller the number of hubs (and the longer 
the spokes). The journey time associated to a graph G 
is then proportional to 

T(G)-^minC[^(i,j)] (209) 

(where 7r(i,j) is the set of paths from i to j). The total 
length of the graph is ^t(G) = J2ij AijdE{i, j) and repre- 
sents the cost (if we assume that the cost is proportional 
to the length of connection between two adjacent nodes). 
We thus can have an interplay between the cost repre- 



sented by the total length and the total journey time 
leading to a variety of behaviors. If we constrain the 
routes to have m average hops, then the minimal total 
length scales as [11] 

eriN) - TV^^™) (210) 

where the following bounds are obtained: /3(2) ~ 3/2, 
/3(3) = 13/10, /3(4) = 7/6. The exponent thus decreases 
when m increases which is expected since a smaller av- 
erage number of hops implies a stronger constraint and 
thus a longer network. Conversely, if we require the net- 
work length £t to scale as N, then the average shortest 
path scales as InlniV [11]. 

In [134] an effective length is assigned to each edge 

e,^ = s + {i-s)dE{i,j) (211) 

and the effective distance on a route tt = 6162 ... be- 
tween two nodes is given by 

mS+{l-S)J2de (212) 

which is Aldous' expression Eq. (208) with the correspon- 
dence A -r-> (5 and a ^ (l—S). Gastner and Newman [134] 
then weight the distance with a passenger number Wij 
and minimize a combination of the form ^t(G) +jT{G). 
We note here that this model was also used together with 
a gravity model for computing Wij in [5] where the au- 
thors compared the resulting optimal network and the 
real- world Indian airline network and found a good agree- 
ment. Gastner and Newman found then for example 
optimal networks shown in Fig. 65. In particular, this 




FIG. 65 Optimal networks obtained with the minimization 
of Eq. (212) for different values of 5. From [134]. 

figure illustrates the expected behavior when the con- 
nection cost increases (similar results were obtained with 
the same model in [136]): for 5 = 0, the connection cost 
is zero, only distance counts and the resulting network 
is essentially a minimal length network, such as a pla- 
nar network of roads for example. When S increases, and 
thus the cost associated to connections, we see fewer hubs 
and longer spokes (the limit being one hub and the corre- 
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FIG. 66 Optimal networks obtained with the minimization 
of Eq. (213) for different values of A. From [61]. 

spending network is the SPT). As noted by Aldous [11], it 
would be interesting to obtain more quantitative results 
in order to be able to compare with some real-world data 
for example. 

In [61], Brede also revisited recently this problem in 
terms of a communication infrastructure with 

E = \iT{G) + {I - \){t) (213) 

In a communication context, the total length represents 
the cost associated with the wires and the average short- 
est path is a measure of the communication efficiency. 
The number of links is not limited and the nodes are lo- 
cated on a regular lattice of dimension d with periodic 
boundary conditions. For A = 0, we obtain the complete 
graph which minimizes {(.) and for A = 1 we obtain a min- 
imum spanning tree (and depending on the dimension we 
can obtain a chain or a star network). For intermediate 
values of A, we obtain different networks interpolating 
between these two extremes (see Fig. 66). 



c. Total length and synchronization. A wide range of sys- 
tems can be described by a system of oscillators coupled 
through a network of contacts and synchronization prob- 
lems have attracted a lot of attention (for a review on 
synchronization in complex networks see [18]). If each 
of the oscillator is described by its individual dynamics 
by <Pi = f{<t>i), the dynamics of the coupled oscillators is 
governed by the following equation 

= f[c^^) + A„-[/i(0,) ~ h{cj>,)] (214) 

3 

where /i is a given output function. The stability of syn- 
chronized solutions is related to the eigenvalues Aq = < 
Ai < • • • < Aat of the graph Laplacian. More precisely, 
Pecora and Carroll [257] showed that the stability is re- 
lated to the ratio -sometimes called 'synchronizability' 
defined as 



and that the smaller e, the more stable the synchronized 
solutions. Donetti et al. [Ill] proposed to construct net- 
works with a fixed number of nodes and average degree 
and which optimize synchronizability (ie. minimizing e). 
The optimal networks are not small-world networks and 
display peaked distributions of degree, betweenness, av- 
erage shortest paths, and of loops. An important feature 
of these networks is that there have a very entangled 
structure with short (£), large loops and no well-defined 
structure. In a recent article [62], Brede extended this 
problem and proposed to minimize both the total amount 
of wire It{G) to connect the network and the synchro- 
nizability characterized by e 

E = A£t(G) + (1 - A)e (216) 

Starting with a one-dimensional chain for A = 0, one ob- 
tains the complete graph which minimizes the synchro- 
nizability. For A = 1, the Id line is optimal and for 
intermediate values of A one obtains the networks shown 
in Fig. 67. Most links are small and the link length I 




FIG. 67 (a-d) Examples of networks constructed for different 
values of A = 0.05, 0.3, 0.8, and 0.95. (e-h) Corresponding 
link length distribution for these different values of A. Straight 
lines in (f-h) represent power laws with exponent 1.05, 1.44, 
and 1.5, respectively. From [62]. 

distribution goes from a peak distribution to a broader 
law. A power law fit of the form P(l) ^ l^" gives a value 
of the exponent in the range [1, 1.5] (for a random graph 
with the same degree distribution and no total length 
constraint, the length distribution is uniform and when 
there is a total length constraint, the distribution is ex- 
ponential). This result means that small- world networks 
can display optimal synchronization but with shortcuts 
covering many different length scales. 

In another paper [63], the same author studied the 
trade-off between enhanced synchronization and total 
length of wire but allowing the nodes to rearrange them- 
selves in space with the constraint that the average Eu- 
clidean distance among nodes is constant. In this model, 
depending on the cost of wire, one observes different or- 
ganization in modules: when the cost of wire A increases 
one observes an organization in a large number of mod- 
ules connected on a ring. 
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5. Beyond trees: noise and loops 

In most examples studied in literature (including the 
ones presented above), the optimal networks are trees. 
However in many natural networks such as veins in leaves 
or insect wings, one observe many loops. Very recently, 
two studies which appeared simultaneously [98, 187] pro- 
posed possible reasons for the existence of a high density 
of loops in many real optimal networks. In particular, it 
seems that the existence of fluctuations is crucial in the 
formation of loops. Maybe more related to the botanic 
evolution of leaves, the resilience to damage also natu- 
rally induces a high density of loops (see Fig. 68 for an 
example of flow re-routing after an injury). 
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FIG. 68 Re-routing of the flow around an injury in a lemon 
leaf. From [187]. 

In these studies, the model is defined on a network 
with conductances Ce on each link and where the total 
power 



k jer{k) 

is minimized under the cost condition 
1 



2E E 

k jer(k) 



1 



(217) 



(218) 



where in this equation it is assumed that the cost of a 
conductance C]^j is given by C^^ where 7 is a real number. 

Following [187], we can introduce two variants of this 
model. The first one which represents the resilience to 
damage is defined as follows. We cut a link e and compute 
for this system the total dissipated power denoted by P'^ . 
The resilience to this damage can then be rephrased as 
the minimization of the functional 



(219) 



therefore one link with infinite resistance in the system 
and the dissipated power P'^ is infinite. The finiteness of 
R implies the existence of loops in the optimal network. 
In another model, [187] Katifori introduces time fluctu- 
ating load by introducing a system with one source at the 
stem of the leaf and one single moving sink at position 
a. For this system, one can compute the total dissipated 
power P° and resilience to fluctuations can be rephrased 
as the minimization of the functional 



F = ^P'' 



(220) 



Brolten bonds 



Moving sink 




Note that if breaking e disconnects the network, there is 



FIG. 69 Optimal networks for the resilience to damage (left) 
and to a fluctuating load (right panels). From [187]. 

In these models, one observes the formation of loops 
(see Fig. 69), reminiscent of the ones seen in real leaves. 
These recent studies shed a new light in the formation 
and the evolution of real- world networks and open inter- 
esting directions of research. In particular, it would be 
very interesting to understand more quantitatively the 
condition of appearance of loops for example. 
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F. Summary: Effect of space on networks 

We presented in this chapter the most important 
classes of models of spatial networks formation. When 
spatial constraints are present, the network structure is 
affected and we summarize the main effects below. 

• Effect of space embedding on degree distribution, 
assortativity, clustering, and average shortest path 

Spatial constraints imply that the tendency to con- 
nect to hubs is limited by the need to use small- 
range links. This explains why the degree distri- 
bution has a cut-off depending on the node density 
and the almost flat behavior observed for the assor- 
tativity. Connection costs also favor the formation 
of cliques between spatially close nodes and thus 
increase the clustering coefficient. The increasing 
tendency to establish connections in the geograph- 
ical neighborhood leads to a large average shortest 
path. If the density of shortcuts is too low, the 
average shortest path can behave as A^^/*^ as in a 
d— dimensional lattice. In the opposite large 
density of shortcuts leads to a logarithmic behavior 
typical of small- world networks. 

• Effect of space embedding on centrality 

Spatial constraints also induce large betweenness 
centrality fluctuations. While hubs are usually very 
central, when space is important central nodes tend 
to get closer to the gravity center of all points. Cor- 
relations between spatial position and centrality 
compete with the usual correlations between degree 
and centrality, leading to the observed large fluctu- 
ations of centrality at fixed degree: many paths go 
through the neighborhood of the barycenter, rein- 
forcing the centrality of less-connected nodes that 
happen to be in the right place; this yields larger 
fluctuations of the betweenness centrality and a 
larger number of 'anomalies'. 

• Effect of spatial embedding on topology-traffic 
correlations 

As we can see with growing network models where 
preferential attachment plays a role, spatial con- 
straints induce strong nonlinear correlations be- 
tween topology and traffic. The reason for this 
behavior is that spatial constraints favor the forma- 
tion of regional hubs and reinforce locally the pref- 
erential attachment, leading for a given degree to a 
larger strength than the one observed without spa- 
tial constraints. Moreover, long-distance links can 
connect only to hubs, which yields a non-linear be- 
havior of the distance-strength with values /3d > 1: 
the existence of constraints such as spatial distance 
selection induces some strong correlations between 



topology (degree) and non-topological quantities 
such as weights or distances. 

• Space and optimal networks 

Spatial constraints combined with efhciency lead 
to a large variety of networks, with the existence of 
hub-and-spoke organization. Also, the emergence 
of complex structure in trafRc organization could 
be explained by an optimization principle. In par- 
ticular, strong correlations between distance and 
traffic arise naturally as a consequence of optimiz- 
ing the average weighted shortest path. In the op- 
timal network, long-range links carry large traffic 
and connect regional hubs dispatching traffic on a 
smaller scale ensuring an efficient global distribu- 
tion. These results suggest that the organization of 
the traffic on complex networks and more generally 
the architecture of weighted networks could in part 
result from an evolutionary process. In the case of 
optimal networks, fluctuations and resilience to at- 
tack lead naturally to the formation of loops which 
shed some light on the evolution of various systems. 



V. PROCESSES ON SPATIAL NETWORKS 

We will focus in this chapter on various processes which 
take place on spatial networks and the guiding idea is to 
focus on the effects of space. In this respect, we will not 
address the general results concerning complex scale-free 
networks as there are already many reviews and books 
on these (see for example [32, 87, 245]) but we will try 
to address specifically effects which can be related to the 
spatial aspects of networks. In particular, we will essen- 
tially focus on studies using the small-world model with 
shortcuts located at random on a d-dimensional underly- 
ing lattice. This model which interpolates nicely between 
a 'pure' spatial lattice and a 'pure' random graph with 
one parameter only, represents an interesting playground 
to test theoretical ideas and models. 

We will discuss six different processes. We will begin 
with the study of Ising model on spatial networks as it 
represents the archetype of phase transition and could 
be used in the modelling of very different systems. We 
will then discuss random walks and synchronization on 
spatial networks, followed by the important problem of 
navigating and searching in a spatial network. We will 
then discuss the robustness and resilience of spatial net- 
works, having in mind more practical examples such as 
the robustness of power grids for example. Finally, we 
will discuss the effect of space on the spread of disease. 



A. Ising model 

It is well known in statistical physics that long-range 
interactions affect the behavior of most systems (see for 
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example [240]) and we expect that long shortcuts in the 
small-world lattice will indeed change the thermodynam- 
ics of the usual Ising model on a regular lattice. In this 
section, we start by recalling the main results on the Ising 
model on lattices and we will then review the results on 
spatial networks such as the Watts-Strogatz small-world, 
and Apollonian networks. We will end this section with 
a general discussion on the effect of a spatial embedding 
on the critical behavior of phase transitions. 



1. Generalities on the Ising model on lattices 

The Ising model is a pillar of statistical physics and 
numerous books and reviews can be found on this sub- 
ject. It was developed and studied in the framework of 
phase transitions, but the simplicity of the Ising model is 
the main reason for its ubiquity and for its various appli- 
cations to many fields, even far from traditional physics. 
Indeed, in a first approach it is tempting for example to 
describe the behavior of an agent by a binary variable. 
It could be the answer 'yes' or 'no' to a certain question 
or the choice between 'A' or 'B' for example. The Ising 
model then appears as a very useful model for describing 
collective phenomena in social systems, such as opinion 
formation and dynamics for example (for applications of 
statistical physics to social systems, we refer the inter- 
ested reader to the review [76] and references therein). 

The Ising model is usually defined on a lattice and 
on each node, there is an Ising spin which is an object 
which can take two values only: cTi = ±1. The interaction 
between spins is described by the Haniiltonian 

where Jij represents the energy gain if two spins are 
aligned. The usual ferromagnetic Ising model on a lat- 
tice is obtained for Jij = J > for neighbors on a lat- 
tice. At zero temperature T — 0, the ground state of 
the Hamiltonian Eq. (221) with ferromagnetic couplings 
is obtained for all spins aligned and the magnetization 
is then M — J^i'^i/^ — Depending on the dimen- 
sion of the lattice, we obtain different collective behaviors 
that we recall here briefly (for the ferromagnetic case). 

• For one-dimensional systems, the ordered ferro- 
magnetic configuration is obtained only for T = 
and there is no phase transition at finite tempera- 
ture. 

• For d > 2, the situation is different: at tempera- 
tures low enough, an ordered state is possible (with 
M — ±1) while at high temperature this order is 
destroyed and M — (for large N). The compe- 
tition between the thermal agitation and the in- 
teraction leads in fact to a non-zero temperature 
transition Tc > (for N oo). 



• The upper critical dimension of the Ising model is 
c?c = 4 which means that above 4, the critical ex- 
ponents are those of the mean-field obtained in the 
infinite dimensional case. 

• In the infinite dimensional case or equivalently in 
the case of a fully connected graph, the behavior 
of the Ising model is completely understood with 
the so-called mean-field theory. This approach is 
appropriate when the number of neighbors is very 
large, in which case fluctuations are negligible and 
only the average number of spins up and down is 
important. 



2. Ising model on small-world networks 

The first studies on the Ising model were done on lat- 
tices and later on planar networks. In particular, the 
critical behavior on a random planar lattice has been elu- 
cidated by Boulatov and Kazakov in an important paper 
[59]. More recently, Ising models were also studied on 
complex networks (see for example [32]) and we will fo- 
cus here on the Watts-Strogatz network where we will 
discuss the effect of long-range links on the nature and 
existence of a transition in the model. 

In the complex network field, very quickly after Watts 
and Strogatz published their paper, Barrat and Weigt 
[28] studied the one-dimensional WS model where at p = 
0, the network is a onc-dimensional ring where each node 
is connected to its 2m nearest neighbors by ferromagnetic 
bonds. In this situation the transition temperature is 
Tc — and the question is how the long-range links will 
modify this one-dimensional result. In [28], the authors 
use the replica approach to study this problem. They 
found that at high temperature, the paramagnetic phase 
is stable for any value of p. For small values of p, an 
expansion in terms of p^o can be performed where ~ 
exp{Jm{m + 1)/T) is the correlation length of the Id 
system, and implies that the first order approximation 
breaks down for pS^o ^ 1. This result means that for 
finite p, there is a transition temperature given by [28] 

~ (222) 
\ogp 

We can understand this result using the following argu- 
ment. In the one-dimensional Ising model, the correla- 
tion length is which is the typical size of domains of 
correlated spins (ie. having the same value). The average 
number of long-range links connected to a domain is then 
of order p^o and it is thus clear that when p^o ^ 1 the 
numerous shortcuts will lead to a mean-field behavior. 

In [155], Herrero studied numerically the Ising model 
on WS networks constructed with lattice of larger di- 
mensions (d = 2, 3). In these networks, the presence 
of disorder, even small, leads to a change of universality 
class going from Ising like at p = to mean-field type 
for p > (as signalled by mean-field values of the critical 
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exponents). The Id argument for the transition temper- 
ature is also stih correct and Tc is such that the spin 
correlation length is of order the typical size of small- 
world clusters N* ^ 1/p (see section IV.C.l) leading to 
the behavior 

T,{p) ^-1/logp (223) 
for d = 1, and for d > 1: 

T,{p) - T,(0) ^ p^/'^" (224) 

Chatterjee and Sen [79] and Chang, Sun, Cai [78] 
studied the Ising model on one-dimensional WS network 
with shortcuts whose length is distributed according to 
P{e) - (see [195] and section 1V.C.2). Accord- 

ing to [79], there is a finite transition temperature for 
< a < 2. Their results seem to indicate that for a < 1 
the behavior is mean-field and that for 1 < a < 2, the 
system has a finite-dimensional behavior. This last result 
is however still under debate as in [78] the numerical re- 
sults seem to indicate that for the whole range < a < 2, 
the behavior is mean-field like. 

The Ising model was also studied on Apollonian net- 
works in [178] where it is shown that for a model with 
decaying interaction constants of the form J„ oc n~" 
(where n denotes the generation number), the critical 
temperature depends on the number of generations n as 
Tc{n) - where T varies continuously from t(0) — 1 

to r(oo) = 1/2 indicating a new kind of critical behavior. 



3. Critical fluctuations 

In [60], Bradde et al. investigate from a general point 
of view the role of the spatial embedding on the critical 
behavior of phase transitions. They considered the Ising 
model as a prototype for studying the complex behavior 
induced by space. The Ising spins are located on the 
nodes of a d— dimensional euclidean space and two nodes 
i and j are connected with a probability p.y which reads 

p,,^e,e,Ji\n-rj\) (225) 

where rj^) denotes the position of node The hidden 
variables 9i are found by fixing some conditions such as 
the average degree for example. 

The important quantity here appears to be the spec- 
trum p(A) associated to the matrix pij (which is similar 
to the adjacency matrix). We denote by A its largest 
eigenvalue, and by Ac the spectral edge equal to the av- 
erage value of the second largest eigenvalue Ac = (A2) 
and such that for A < Ac the spectrum is self-averaging. 
The spectral gap is defined as Ajy = A — Ac- Under the 
assumption that the gap is self-averaging in the thermo- 
dynamic limit and A^r — >■ A, Bradde et al. distinguish 
two possible behaviors according to the value A. If A > 
when T — )■ Tc, the fluctuations are mean-field. If A = 
then the fluctuations are mean-fleld or not depending if 



p(A) is vanishing faster or slower than Ac — A close to the 
edge. 

For homogeneous networks (obtained with 9i — 9), 
Bradde et al. recovered in the exponential case for J(r) 
the classical Ginsburg result stating that for d < 4 crit- 
ical fluctuations are relevant. For a power law behavior 
J(r) ~ r~", the mean- field approximation is exact for 
d < a < 3d/2 and for a > 'id/2, non-trivial exponents 
are expected. For a < d, a non-zero gap appears in the 
spectrum suggesting a mean-field behavior. In addition, 
Bradde et al. studied the case of complex networks em- 
bedded in a d = 2 space with a power-law degree distri- 
bution with exponent 7 and with an exponential function 
for J(r) — exp(— r/do). For small do the effect of space is 
important and we expect non-trivial effects of space with 
a spectrum close to Ac behaving as p(A) ^ (Ac — A)''. 
For instance, they showed that for 7 = 4 the mean-field 
behavior holds while for a larger value 7 = 6, the critical 
fiuctuations are not captured by the mean-field. From a 
more general perspective, these first results seem to indi- 
cate that this approach opens interesting perspectives in 
the understanding of phase transitions in spatial complex 
networks. 



B. Random walks in spatial networks 

Random walks on graphs is an important and old sub- 
ject in both statistical physics and in mathematics. Many 
books can be found on this subject (see for example [315]) 
and in this chapter we will focus on how shortcuts in a 
Watts-Strogatz network will affect the usual lattice be- 
havior. For more results and details on random walks on 
general complex networks, we refer the interested reader 
to the review [54] or to the book [32]. We also note that 
random walks are also studied in connection with spec- 
tral graph theory and we refer the interested reader to 
the book [83]. 



1. Quantifying the effect of shortcuts 

The random walk can easily be defined on a 
d— dimensional lattice, for example in 1/ where the 
walker jumps at every time step to one of its 2d neigh- 
bor with equal probability. We just recall here the most 
important results. 

If we denote by pij the probability that a walker at 
node i will jump to node j, the master equation for the 
probability P(i,t]io,0) that a walker starting at node 
at time t = is visiting the node i at time t is 

dtP(i, ijio, 0) = ^ Aj,pj,P{j, t\ia, 0) - A,,p,jP{i, t\iQ, 0) 
3 

(226) 

where Aij is the adjacency matrix. A usual choice for 
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the transition rate is given by 

P^j - f (227) 

where ki is the degree of node i. This choice corre- 
sponds to the case of uniform jumping probabihties to 
the neighbors of i and where r = J^jPij is the total 
escape rate. With this simple choice, the stationary so- 
lution of Eq. (226) is 

P(z,oo|zo,0) = (228) 



a. Return probability. Many important quantities can be 
calculated with the use of the (modified) Laplacian op- 
erator on the network which can defined as 

Lij = kiSij — Aij (229) 

where ki is the degree of node i. The link between ran- 
dom walks on graph and spectral graph theory is known 
for a long time (see for example [83]). In particular, the 
eigenvalue density p{X) of this operator gives the return 
probability Po{t) (which is the probability that starting 
from the origin the walker returns to it at time t) 

/•DO 

Po(i)= / e-^'p{X)dX (230) 
Jo 

This relation implies in particular that the long time limit 
of Po{t) is controlled by the behavior of p{X) for A — > 0. 

For example, for a d— dimensional lattice, the spectral 
density behaves as [119] 

Pl{X) ^ A-^/^-i (231) 

which gives the classical result 

Poit) - ^ (232) 

This quantity allows us to construct a typology of walks 
for infinite lattice. For finite lattices, the walk is always 
recurrent and for infinite lattice, the nature of the walk 
can be characterized by the behavior of the quantity 

1= [ Po{t)dt (233) 
Jo 

for T — >■ oo. We thus find the usual results, namely 
that for d < 2, the walk is recurrent and for rf > 3 it is 
transient. 

In the case of an ER graph, Rodgers and Bray [267] 
showed that the density of states behaves as 

PerW - e-=/^ (234) 



leading to the following long time limit 

Po^^(0 ^ e-^'"'" (235) 

where a and c are non-universal parameters. In the case 
of the ER random graph the return probability thus de- 
creases faster than the lattice: the random links allow 
the walker to explore other regions of the network and 
make its return more difficult. 

If we now think of a c?-dimensional variant of the 
Watts-Strogatz (see section IV.C.l), we add a fraction 
p of shortcuts to a d— dimensional lattice and the natural 
question is how these shortcuts will affect the diffusion 
process. This problem was studied numerically in [174] 
and an analytical answer was given by Monasson in [235] 
who showed that the eigenvalue density for the WS model 
can be approximately written as 

PwsW - PlW X perW - X'/^-'e-P/^ (236) 

leading to the following behavior for the return probabil- 
ity 

Po(t)-Po(oo)^|'_(^!,^V3 JJJ'l (237) 

where Po(oo) = 1/iV, and where ti is the crossover time. 
In the first regime, at short times, the walker didn't en- 
counter shortcuts and we recover the d-dimensional be- 
havior. After a time t, the volume explored is of order 
V ~ i''/^ and the number of shortcuts in this volume is 
given by pV . The crossover time is thus such that pV ^ 1 
leading to 

h - l/p2/'i (238) 

After this time, the walkers 'feel' the small-world nature 
of the network and we recover the ER behavior given by 
a stretched exponential. 



b. Number of distinct nodes, mean-square displacement and 
first-time return probability. In [13], the authors study 
the effect of shortcuts on different quantities such as the 
number N^ov of distinct sites visited at time t (which 
was also studied in [172]), the mean-square displacement 
{r'^{t)) and the first-return time distribution. 

In the case of d-dimensional lattices and with a tran- 
sition probability pij = 1/ki, Dvoretzky and Erdos [117] 
studied the average number of distinct sites and found 
the following behavior 

{Vt d=l 
t/\nt d^ 2 (239) 
t d>3 

In the case of a one-dimensional WS network with a 
probability p to have a shortcut and an average num- 
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ber of shortcuts given by a; = pN , these behaviors will 
be modified and simple scaling arguments can help us 
to understand the effect of the shortcuts [13]. In the 
WS network, the typical size of a cluster is ^ ~ 1/p 
(see section IV.C.l) and for a time t ^ the random 
walker doesn't encounter any shortcuts and we expect 
a lattice-like behavior with N^ov ~ V^- For very large 
times t ^ 7V^, the walker has visited the whole net- 
work and we have N^ov ~ N . For intermediate times 
^ t ^ iV^, the walker spends on average a time 
ti ^ before encountering a shortcut. At time t the 
number of shortcuts and thus of different segments vis- 
ited is t/ti ~ i/^^ and the total number of distinct visited 
sites is then N^ov ^ {t/i'^) x ^'^ ^ t [13, 172]. We can 
summarize these behaviors in the following scaling form 

N,,,^NSi^,x) (240) 

with 

{yy/x, y<l 
y/x, K y < X (241) 
1, y > a; 

This scaling is confirmed by the excellent collapse ob- 
served numerically (see Fig. 70). 

A similar line of reasoning can be applied to the mean- 
square displacement {r'^{t)) where the distance is here 
computed in terms of hops from the origin (ie. the length 
of the shortest path from the origin to the node where the 
walker is located). On a d-dimensional lattice the mean- 
square displacement behaves as (r^(t)) ^ t, a behavior 
that we expect on the WS networks for times such that 
i <C 5^. For large times, the mean-squared displacement 
saturates to a value equal to the diameter squared (we 
recall here that the diameter is defined as the maximum 
of all distances between pairs and is well approximated 
by the average shortest path {£) ) and the saturation thus 
reads (r^(t)) — )■ These behaviors can be encoded in 
the following scaling form [13] 

{r\t)) = {irR{^,x) (242) 

with 

which is confirmed numerically (see Fig. 70). 

2. Diffusion on power law Watts-Strogatz networks 

In [198], Kozma, Hastings and Korniss study diffusion 
in a dimensional Watts-Strogatz networks of linear 
size L where the shortcuts are connecting two nodes i 
an j (located at r and r', respectively) with a probability 




FIG. 70 (a) Rescaled number of distinct sites visited at time t 
versus the rescaled time. The symbols correspond to different 
values of x (circles: x — 10, squares: x — 10^, stars: x = 
10'') and for each value of x two values of p are used (0.0002 
and 0.01). The collapse is very good making it difficult to 
identify the two curves for each x. (b) Rescaled mean-square 
displacement versus rescaled time showing a perfect collapse 
(for each x there are two curves for two different values of p) . 
From [13]. 



decaying as p[r — r'[~". The exponent a thus character- 
izes the length of shortcuts and the p is essentially the 
probability of a site to have a shortcut. Analytical cal- 
culations are simpler in the case (which is believed not 
to modify the universality) where multiple links are al- 
lowed with a Poisson distribution of links with average 
p/Af\r — r'[" where A/" is a normalization constant. 

The equation studied in [198] and which describes the 
random walk process is 

dtPir, t) = - ^(A„, + gA-"'^)P(r', t) (244) 

r' 

where P(r, t) is the probability to find the walker at site 
r at time i, — A^,-' is the discretized Laplace (diffusion) 
operator. For example for d — \, one has —AijAIj ~ 
M,+i - M, - (M, - M,_i) giving 

Ay = 2(5ij - ~ (245) 
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The quantity —A™,"'' is the diffusion operator for the 
random hnks with diffusion coefficient q. On the d = \ 
iattice, one has A™"'^ = 5ij ^i^i Ju — Jij where Jij is 
the number of finks connecting the nodes i and j. 

The Green's function G{r,r',t) for this modef is the 
solution of the equation (244) with the initiaf condition 
G{r,r',t = 0) = S{r — r'). The Fourier transformed 
Green's function in the Umit cj — > 

G[r,r') = fim(iw + A + 9A™"'*)-i(r, r') (246) 

averaged over disorder gives a translation invariant quan- 
tity depending on |r — r'| only. For r = r', we obtain 
(G(0)) which is related to the return probability (and is 
in fact the quantity / defined in Eq. (233)) and its scaling 
with the size L — >■ oo determines if the random walk is 
transient or recurrent. 

The authors of [198] used a self-consistent perturbation 
theory and obtained the phase diagram for all dimensions 
and values of a (a detailed version of this work can be 
found in [199]). The results in the one-dimensional case 
can be summarized in the figure (71) where a comparison 
is made with an annealed approximation where A''""'* is 
replaced by its average. In particular, for d — 1^ these re- 
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FIG. 71 Phase diagram for d — 1. The annealed approxima- 
tion shows some differences with the self-consistent approxi- 
mation for the quenched case. In the quenched case, the waik 
is transient for q < 2 and recurrent for a > 2. From [198]. 

suits are in agreement with the anaiysis presented in [173] 
which predicted a = 2 as a crossover between a recurrent 
and transient regime. In other words for a > 2 the short- 
cuts do not affect the lattice behavior whife when a < 2, 
there are fong shortcuts feading to entireiy different re- 
gions of the networks. They modify quaiitatively the iat- 
tice behavior and make it look like a large d-dimensional 
lattice, in agreement with other processes such as spin 
models where the mean-field behavior is recovered for 
p ^ (in the case a = 0. 



C. Synchronization 

Many natural phenomena can be described as a set of 
coupled oscillators. This is the case for cardiac cells, fire- 
flies, etc. The synchronization properties depend on the 
coupling pattern and many studies were devoted to the 
effect of the coupling network structure (see the review 
[18]). 

Each oscillator i can be described by a scalar degree of 
freedom 0; and each unit evolves according to an internal 
dynamics governed by (f>i = f{4>i). If these oscillators 
are linearly coupled by a network, the output is a linear 
superposition of of the outputs of the neighboring units 
and the evolution equations take the form of 

^^fi^^) + cTj2aM^,) (247) 

i 

where Cy is the coupling matrix, a is the interaction 
strength and h a fixed output function. An important 
case corresponds to Cij — Lij where L is the graph 
Laplacian. In an important paper, Pecora and Carroll 
[257] showed that the stability of synchronized solutions 
is related to the ratio 



and that the smaller e, the more stable the synchronized 
solutions. 

After Watts and Strogatz published their paper [313] 
synchronization properties were studied on SW networks 
which showed that these networks were both able to sus- 
tain fast response due to the long-range links and both 
coherent oscillations thanks to the large clustering [204] . 
In particular, the ratio e was determined for SW net- 
works [23] where additional edges are randomly wired on 
a one-dimensional ring. They obtained the general result 
that the small- world topology exhibits better synchroniz- 
ability properties than ordered rings and that it can be 
achieved with a small density of shortcuts [163]. 

Finally, let us mention the work [218] which presents a 
study of coherence in various networks, including Apollo- 
nian networks (see [178] and section IV. A. 4). The main 
result is that for the Apollonian network, the coupling 
threshold beyond which coherence disappears decreases 
when the heterogeneity is increased. Also, the transition 
to coherence in this case is shown to be of the first order 
[218]. 



D. Navigating and searching 

1. Searchable networks 

The original 1967 experiment of Milgram [230] show- 
ing that the average shortest path in North- America is 
around 6 raises a number of questions. The first one is 
about the structure of the social network and it is now rel- 
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atively clear that enough shortcuts will modify the scal- 
ing of {tj and induce a logarithmic behavior. Another 
question raised by Kleinberg [195] is actually how a node 
can find a target efficiently with only a local knowledge 
of the network (the answer being trivial if you know the 
whole network). It thus seems that in some way the social 
network is search-efficient- or is a searchable or naviga- 
ble network- meaning that the shortcuts are easy to find, 
even by having access to local information only. In these 
cases, one speaks of navigability or searchability when 
the greedy search is efficient. 

This problem goes beyond social networks as decentral- 
ized searches, where nodes only possess local information 
(such as the degree or the location of their neighbors for 
example) in complex networks have many applications 
ranging from sensor data in wireless sensor networks, lo- 
cating data files in peer-to-peer networks, finding infor- 
mation in distributed databases (see for examples [299] 
and references therein). It is thus important to under- 
stand the efficiency of local search routines and the ef- 
fect of the network structure on such decentralized algo- 
rithms. 

In the case of social networks, it seems that they are 
composed by a spatial part constituted of the neighbors 
of a node belonging to its spatial neighborhood (such 
as in a regular lattice) and a purely social component, 
not correlated with space and which can connect regions 
which are geographically very far apart. In a search pro- 
cess, it is thus natural to try these links which open the 
way to very different parts of the world (in Milgram's 
experiment it is indeed interesting to note that individ- 
uals were passing the message only according geogra- 
phy or proximity in the space of professional activities 
[191], as it was known to them that the target individ- 
ual was a lawyer). In order to quantify this, Kleinberg 
[195] constructed a d-dimensional Watts-Strogatz model 
where each node i of the lattice 

• (i) is connected to all neighbors such that their lat- 
tice distance is less than p (with p> 1). 

• (ii) has q shortcuts to node j with a probability 
decreasing with the distance 



p{i j) - dE{i,j) ' 
where a is a tunable parameter. 



(249) 



The search process is the following one: a message 
needs to be sent to a target node t whose geographical 
position is known and a node i which receives the mes- 
sage forwards it to one of its neighbor j that is the closest 
(geographically) to t. This is the simplest decentralized 
algorithm that we can construct (and which requires only 
geographical information). The most important figure of 
merit for this type of algorithm is the delivery time T 
(or its average T which is easier to estimate analytically) 
and its scaling with the number of nodes N. Kleinberg 
found bounds on the exponent of T (see Fig. 72) and 
the important result is that the delivery time is optimal 
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FIG. 72 Lower bounds of the exponent governing the be- 
havior of the average delivery time T as a function of the 
exponent a controlling the distribution of shortcuts. After 
[195]. 



for a = d for which it scales as log N while for a ^ d, 
it scales faster (as a power of N). This behavior can 
be intuitively understood: for a > d, long links are rare 
and the network looks essentially as a lattice (with renor- 
malized spacing). In the opposite case a < d, shortcuts 
are all long and not necessarily useful. The best case 
is obtained when the shortcuts explore all spatial scales, 
which is obtained for a = d (This result was extended in 
[266] to the case of small-world network constructed by 
adding shortcuts to a fractal set of dimension df). 



2. Sketch of Kleinberg's proof 

Inspired from Kleinberg's original rigorous derivation 
[194] of the bounds shown in Fig. 72, we can give the 
following hand-waving arguments in order to grasp some 
intuition on the effects of the link distribution on the av- 
erage number of steps to reach a target in a decentralized 
algorithm (we will discuss in detail here for the case for 
d = 2 but the extension to a generic d is trivial). For 
the interested reader, we also note that a detailed study 
of the d = 1 case is done in [335], that exact asymptotic 
results were obtained in [74], and that the 'greedy' paths 
connecting a source to the target were studied in [295] 
by defining a greedy connectivity. 

For a = 2 (and we assume here that p — q — 1), the 
probability to jump from node u to node v is given by 



P{u v) = 



1 



1 



Z dsiu, v) 



(250) 



where Z is the normalization constant given by 
1 



Z = 



V 

^ dB(u,w)2 
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(251) 
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implying that P{u v) ^ 1/ In NdE{u,v)'^ (here and 
in the following we will use continuous approximation 
and neglect irrelevant prefactors-for rigorous bounds we 
refer to [194]). Following Kleinberg [194], we say that the 
execution of the algorithm is in phase j when the lattice 
distance d from the current node (which is holding the 
message) is such that 2^ < d < 2^+^. The largest phase 
is then In N and the smallest when the message reaches 
the target node. The goal at this point is to compute the 
average number of steps T to reach the target. For this 
we decompose the problem in computing the average time 
duration Tj that the message stays in phase j. For this we 
have to compute the probability that the message leaves 
the phase j and jumps in the domain Bj defined as the set 
of nodes within a distance 2^ to the target node t (see Fig. 
73). The size of this set Bj is \Bj \ ~ 2^-' and the distance 




FIG. 73 Line: jump out of the phase j into the domain Bj. 

between u and any node of Bj is d{u,v G Bj) < 2-'"'"^. 
The probability to get out of phase j by using a long- 
range link is thus 



Pout 



\B, 



1 



(2^+2)2 In InN 
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(the actual exact bound found by Kleinberg is Pout > 
1/(136 InA^). We then have P{Tj ^ i) = [1 - PoutYPout 
from which we obtain Tj ~ In A*". The average time to 
reach the target is then 



In Af 
.7=0 



(253) 



which is the minimum time obtained for a decentral- 
ized algorithm for a — 2 (and a — d for the general 
d-dimensional case). 

We now consider the minimum scaling of T in the case 
a <2 (and general p and q) . The normalization constant 



behaves then as 



.N/2 

Z = 22 dsiu, v)-"" ~2tt I r'°'rdr 



2~a 



(254) 



We assume now that the minimum number of steps to 
reach the target scales as . In this case, there is nec- 
essarily a last step along a long-range link leading to a 
node which is different from the target node and which 
is in the region U centered at t and of size ~ pN^ (Fig. 
74). The probability Pi that this long-range link leading 



Tr^N^ Steps 




source node s 



targot nodG t 



dr^pN' 



FIG. 74 One dimensional representation of Kleinberg's the- 
orem in the case a < 2. 



to U at step i is given by 



(255) 



and the probability that it happens at any step less than 
is 



3<5-2-|-a 



(256) 



This probability is non-zero only if 35 — 2 + a > leading 
to the minimum possible value for 5 such that T ^ 



2- a 



(257) 



(the d-dimensional generalization would give S„iin = {d— 
a)/{d+l)). 

In the last case a > 2, we will have mostly short links 
and the probability to have a link larger than m is given 
by 



P{dE{u, v) > m) 



^rdr 



(258) 



In the following we will use the notation e = a — 2. The 



67 



probability to have a jump larger than N'^ for T < 
is then given by 



This probability will be non-zero for 

/3 - 76 > 



(259) 



(260) 



Also, if at every step during a time T ^ , we perform a 
jump of size N'^ the traveled distance must be of order N 
which implies that N^N'^ ^ N leading to the condition 
/3 + 7 = 1. This last condition together with Eq. (260) 
leads to the minimum value of /3 



(3. 



1 



(261) 



which can be easily generalized to {a — d)/{a ~ d + 1) 
in d-dimensions. We thus recover the bounds 6min and 
Pmin shown in the Fig. 72. 



3. Searching in spatial scale-free networks 

Kleinberg thus showed that a simple greedy search 
passing the message to the neighbor which is the nearest 
to the target is then able to find its target in a logarith- 
mic time for a lattice with a = d. The lattice considered 
in Kleinberg's paper is a variant of the WS model and 
has a low degree heterogeneity. When a large degree het- 
erogeneity is present it is not clear that the greedy search 
used in [195] will work well, as it might be best to jump 
to a hub even if there is neighbor closer to the target 
node. In order to understand the effect of heterogene- 
ity in spatial networks, Thadakamalla et al. [299] stud- 
ied decentralized searches in a family of spatial scale-free 
network where the nodes are located in a d-dimcnsional 
space: 

• With probability 1 — p a new node n is added and 
is connected to an existing node i with a prefer- 
ential attachment probability weighted by the dis- 
tance (see section IV. D) 



oc k,F{dE{i,j)) 



(262) 



where F{d) is a decreasing function of distance (and 
can be chosen as a power law c?~°' or as an expo- 
nential exp(— (i/do)- 

• With probability p, a new edge is connecting exist- 
ing nodes with probability 



Ui^j (X kikjF{dE{i,j)) 



(263) 



The authors of [299] investigated the following search 
algorithms which cover a broad spectrum of possibilities: 



1. Random walk: The message goes from a node to 
one of its randomly chosen neighbor. 

2. High-degree search: The node passes the message 
to the neighbor which has the largest degree. This 
algorithm is already very efficient for non-spatial 
network [2]. 

3. Greedy search: This is the algorithm used in Klein- 
berg's study [195] and where the node i passes the 
message to the neighbor which is the closest to the 
target (ie. with the smallest c?£;(z,t)). 

4. Algorithms 4-8: The node passes the message to the 
neighbor which minimizes a function i^[fci, d£;(z, t)] 
which depends both on the degree of the node and 
its distance to the target. The function F con- 
sidered here are: (i) Fi[k,d] = d — f{k) where 
/(fc) is the expected maximum length of an edge 
from a node with degree fc; (ii) F2[k,d] cx d^] 
(iii) F^[k,d] = d/k; (iv) F4k,d\ cx (v) 
Fr,[k,d]=d/{\nk + l). 

The main result obtained in [299] is that algorithms 
(4 — 8) perform very well and are able to find a path be- 
tween the source and the target whose length is at most 
one hop more than the average shortest path. This result 
is surprising: the calculation of the shortest average path 
requires the knowledge of the whole network, while the 
algorithms used here have only local information. This 
success can be attributed to the fact that the scale-free 
networks considered in this study have hubs which allows 
to find efficiently the target. It should also be noted that 
the greedy search performance is not too bad but with 
the severe drawback that in some cases it doesn't find the 
target and stays stuck in a loop, which never happens 
with algorithms (4 — 8). Similar results were obtained 
for different values of p and a. These results allow the 
authors to claim that the class of spatial networks con- 
sidered here belong to the class of searchable networks. 
The authors checked with these different algorithms that 
it is also the case for the US airline network. 

Finally, we mention Hajra and Sen [151] who stud- 
ied the effect of the transition scale- free/homogeneous 
network ([221] and section IV.D.2.b) on the navigability 
for three different search algorithms. In particular, they 
showed that the effect of the transition on navigability 
is marginal and is the most pronounced on the highest 
degree-based search strategy which is less efficient in the 
power-law regime. 



4. Navigability and metric space 

In [57], Boguna, Krioukov, and Claffy, studied the 
navigability on a network constructed on a hidden met- 
ric space (see section IV. A. 3) where the probability to 
connect two nodes (with degree k and k' drawn from 
P{k) k~'^) lying for example on a one-dimensional 
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space is given by 



r{d-k,k') 



dc{k,k') 



(264) 



where a > 1 controls the clustering (the larger a and the 
larger the clustering coefhcient) and where dc{k,k') ^ 
kk' (other forms are possible). Boguiia, Krioukov and 
Claffy analyzed the result of a greedy algorithm on these 
networks with different values of a and 7 and found that 
the average shortest path decreases with smaller 7 and 
larger a. A large clustering and degree homogeneity thus 
favors the greedy algorithm, a somewhat expected result, 
since hubs allow a fast routing and large clustering an 
efficient local search. Strong clustering also favors the 
fraction of successful paths (ie. finding the target). If 
7 is too large (larger than 2.6) the percentage of suc- 
cessful paths becomes too small which is related to the 
small number of hubs. These results can be summarized 
in Fig. 75(d) where real- world networks are shown in the 
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FIG. 75 (a,b) Success probability ps versus A'^ for various 
values of 7 and for (a) weak clustering and (b) strong clus- 
tering, (c) Success probability ps versus 7 for various values 
of a and for fixed system size A'^ « 10^. (d) The solid line 
separates the navigable from the non-navigable region. From 
[57]. 

clustering-degree distribution exponent plane (C, 7) . The 
solid lines indicate the separation between a navigable re- 
gion and a non-navigable one where the efficiency of the 
greedy algorithm decreases with the system size. Inter- 
estingly enough, we see that all the real-world examples 
of communication and transportation, social and biologi- 
cal networks considered here are in the navigable region. 

These various results (summarized in [201]) show that 
many real-world networks are navigable which means 
that they support efficient communication without a 
global knowledge of the network. From a more theo- 
retical perspective, the authors of [201] suggest the in- 
teresting idea that real-world networks have underlying 
hidden metric space which could be at the source of their 



navigability. 

In the case of the Internet it is however important to 
note that the navigability is not due to the knowledge 
of geography alone. Indeed, the Internet can be mapped 
to a hyperbolic space [58] but the distance in this hyper- 
bolic space encodes multiple factors (such as geography 
and also political, economical, communities, etc.) so that 
navigability results from a combination of all these fac- 
tors. 



5. Effect of cost 

In [215], Li et al. study the navigability on a Watts- 
Strogatz network constructed on a two-dimensional 
square lattice where the pairs of nodes are connected 
with a probability p ~ 1/d". The important difference 
with the case considered by Kleinberg [195] is that there 
is cost associated to the long-range shortcuts and that 
their number is limited by a total value ^j^- dsi^j) < A 
where A represents the amount of resources available for 
the shortcuts. Li et al. study in particular the average 
delivery time and found numerically that it has a mini- 
mum value for a ~ 3 (and a — 2 for d = 1) suggesting 
that the optimal value might be a = d -I- I in contrast 
with Kleinberg's result a ^ d. In addition, the optimal 
delivery time seems to behave as N^^'^ in sharp contrast 
with the behavior (log A^)^ obtained by Kleinberg. These 
result suggest that cost constraints are relevant for the 
navigability condition and it would be interesting to ex- 
tend Kleinberg's result in this case and to understand 
the main mechanisms and how Kleinberg's derivation is 
affected by the cost constraint. 



6. Routing in social networks 

In the study [217], the authors tested the navigabil- 
ity on a social network of bloggers and showed that geo- 
graphic information is sufficient to perform global routing 
(as in Kleinberg's case on a small- world network [195]) 
in a non-negligible fraction of cases: for 13% of the pairs 
source-target, the message reaches the target based on 
geographic information only. On the other hand, these 
authors also showed that the probability that two nodes 
separated by a distance d decays as P{d) ^ 1/d" with 
a « 1 (see section III.B.4). We are thus facing an ap- 
parent contradiction here: for a two-dimensional space 
(which is a good approximation of the US), the naviga- 
bility is obtained for a = 2 according to Kleinberg's result 
and not for a = 1 (see previous section). The authors 
of [217] suggest in fact that the Kleinberg navigability 
condition can be generalized for different networks and 
is obtained when the probability P{u, v) that two nodes 
are connected scales as 



P(u, v) 



1 



R{u, v) 



(265) 
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where R{u, v) is the 'rank' of v with respect to u and 
which counts the number of individuals living between 
u and V. This result from the argument that distance 
alone has no meaning and that one should include the 
density. Indeed, two individuals separated by a distance 
of 500 meters in rural areas probably know each other, 
which is certainly not the case in a dense urban area. If 
the two-dimensional density p is uniform, the rank then 
scales with distance d as 



which size diverges at Pc (which is Pc = 1/2 for a 2d 
square lattice and behaves as Pc ~ 1 /^d for large dimen- 
sions d). At the threshold there is a giant component, or 
infinite percolating cluster which has a universal fractal 
dimension df independent from the lattice. In addition, 
in the vicinity of Pc, the correlation length ^ which mea- 
sures the linear size of finite clusters and the probability 
Poo for a node to belong to the infinite percolating cluster 
scale as 



(266) 



which leads to P{d) ^ XjtP and the navigability condi- 
tion a = d is recovered. The navigability condition of 
Kleinberg thus assumes implicitly a uniform density and 
could be modified for other distributions. Indeed, the au- 
thors of [217] showed that networks based on friendship 
condition of Eq. (265) are indeed navigable and contain 
discoverable shortcuts. The apparent paradox is thus 
solved: for the social network studied in [217], we ob- 
serve R ^ d which explains P{d) ~ l/d and the network 
is still navigable. 



E. Effect of space on robustness and resilience 

Many important infrastructures in our modern soci- 
eties are structured in the form of networks and it is nat- 
ural to question their robustness. Large-scale collapses 
such as blackouts or Internet outages are spectacular il- 
lustrations of this problem. Indeed, very quickly after 
the first publications on small-worlds and scale-free net- 
works, the first studies on robustness appeared [88]. In 
particular, an important distinction appeared between 
robustness in case of random failures and in case of tar- 
geted attacks. In particular, one of the most important 
results [88] states that scale-free networks are very re- 
silient to random failures but very sensitive to targeted 
attacks on hubs for example. 

We will review in this chapter, essentially two types 
of studies which considered the effect of space on the re- 
silience of networks. The first type concerns percolation 
studies which analyze the effect of the underlying lattice 
on the breaking process. Other studies considered the ef- 
fects of additional shortcuts on lattice with the study of 
Watts-Strogatz networks. Another set of studies focused 
on the vulnerability of power grids and on the failure 
cascade process. 



1. Percolation and small-worlds 

The topological effect of the removal of bonds or nodes 
can be understood in the framework of percolation (see 
for example the books [69, 291]). In the case of regular 
lattices and bond percolation with a probability p that 
a bond is present, we observe a percolation transition 
at a finite, non-universal value pc which depends on the 
lattice. Below the threshold, we observe finite clusters 



i-\Pc-pr 

-Poo ^ (P - Pc)f3 



(267) 
(268) 



where the exponent are in the two dimensional case /32 = 
5/36, and 1/2 — 4/3 and in the mean-field case (ie. for 
d > 6) Pmf = 1 and vmf — 1/2. 

The natural question to ask at this point is then the 
effect of shortcuts on this standard percolation behav- 
ior. In [72, 236], the authors use a generating function 
formalism in order to compute various quantities and we 
briefly recall this derivation for the one-dimensional WS 
network where each site is connected to its k*^ nearest 
neighbors and where additional shortcuts are added be- 
tween randomly chosen pairs of sites with probability 
giving an average of (l)kN shortcuts in total. The first 
quantity which is needed is the generating function 



H{z) = Y,P{n), 



(269) 
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where P(n) denotes the probability that a randomly cho- 
sen node belongs to a cluster of n sites other than the 
giant percolating cluster. In other words, below the tran- 
sition H{1) = 1 and above the percolation threshold we 
have P(l) = 1 — Poo- The quantity Po{n) which is the 
probability that a randomly chosen node belongs to a 
cluster of n sites on the underlying lattice is given by 
(for the one-dimensional case and for n > 0) 



Po{n) 



npq''-\l 



(270) 



where g=l — (1— p)*^. We now define the probability 
P(m|n) that there are exactly m shortcuts emerging from 
a cluster of size n and which is given by 



P(m|n) 



N 



1-^ 
N 



2<pkN-m 



(271) 



which indeed represents the number of possible ways to 
choose m end shortcuts with uniform probability n/N 
within a total of 2(j)kN. 

If we assume that there are no loops involving short- 
cuts, we can now write a recursive equation on H(z) by 
noticing that a finite cluster consists of a local cluster of 
n sites and with m shortcuts leading to other clusters 



H{z) = J2 Po{n)z" J2 PMn)[H{z)] 



(272) 
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(this equation can be understood if we note that the pref- 
actor of is the probabihty to belong to a finite cluster 
of size I). For large N, we then obtain 



H{z) = Hoize^'^''^"'^'^-^^) (273) 



where 



(1 - q)^ 

Ho{z) = l-p + pz^^--^ (274) 

From this equation (273), we can then estimate various 
quantities such as the average cluster size 



^(1 + 9) 



I - q - 2k(l)p{l + q) 



(275) 



At the percolation threshold p — Pc, this size diverges and 
we thus obtain an implicit equation for the threshold 



(1-Pc)* 



2fcpc(2-(l-pc 



(276) 



The numerical solution for the percolation threshold ver- 
sus the density of shortcuts (f> is shown in Fig. 76. Close 
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FIG. 76 Numerical solution for the percolation threshold for 
— 10® for site and bond percolation (shown left and right, 
respectively). Circles represent the case k — 1, squares k — 2, 
and triangles k = 5. From [236]. 

to Pc we also obtain (s) ^ {pc — pY^'^ with a — 1, 
P{n) - n-^e-"/"* with r = 3/2. 

These values r = 3/2 and a = 1 correspond actu- 
ally to the mean-field values of percolation. This result 
means that the shortcuts not only modify the percolation 
threshold (which is expected since pc is not universal) but 
also the universality class: the WS model resembles more 
to a random graph in infinite dimension. This result is 
actually consistent with what happens for example for 
the Ising model, which is mean-field like as long as the 
shortcut density is non-zero (see section V.A). 

This one-dimensional model was extended to the d = 2 



case with the help of high-order series expansion [249] and 
Ozana [254] discussed finite-size scaling for this problem 
by noting that we have two length scales in the problem: 
the length ^sw ~ 1/p^^'^ which gives the typical size 
of clusters connected by the shortcuts and the length 
^ which is the usual cluster size for the percolation on 
the underlying lattice. The main result is that short- 
cuts indeed lead to a mean-field behavior and the larger 
their density, the smaller the percolation threshold (see 
Fig. 77). 
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FIG. 77 Percolation threshold for a two-dimensional Watts- 
Strogatz network with a fraction iji of shortcuts (the lines 
are the analytical calculations and the circles represent the 
numerical simulation). The inset is the zoom in loglog on the 
small (j) region (the numerical simulation is compared with the 
analytical calculation for the quantity 1/2 — Pc). From [249]. 

In [310] the authors study the percolation on a model of 
lattice-based scale-free network where each node is con- 
nected to all the neighbors in a radius R distributed 
according to a law P{R) ^ R^^ which implies that 
P{k) ~ fc-T with /3 = (i(7 - 1) + 1. The main rcsuh 
is that the percolation threshold is non-zero for 7 > 2 in 
contrast with scale-free networks with 7 < 3 which dis- 
play the behavior (see for example [32]) Pc{N) — )■ 0. In 
fact, for 7 > 2, the radius distribution is behaving with 
an exponent P > d-\-l and most of the links are short. In 
other words, the behavior should be the one of an almost 
regular lattice (with a rescaled lattice spacing of order 
(i?) < 00) leading thus to a finite percolation threshold. 
We note here that it would be interesting to extend to 
the length for example the study made in [237] where 
the probability that an edge depends on the degree of 
its endpoints, allowing an interpolation between random 
failures and targeted attacks. 

In [20], Auto et al. studied percolation on Apollonian 
networks (see [178] and section IV. A. 4) using real-space 
renormalization. For this two-dimensional spatial, scale- 
free, and planar network, the percolation threshold goes 
to zero in the thermodynamic limit in agreement with 
general results for scale-free networks with 7 < 3. The 
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mass of the percolating cluster however behaves as M '-^ 
e~^lp (where A is a constant), a result reminiscent of the 
marginal case 7 = 3. 

2. Cascade of failures in infrastructures 

A small local failure will lead to a major breakdown 
when it can propagate and reinforce itself. A simple il- 
lustration of this phenomenon is displayed in the exam- 
ple of the random fuse network introduced in 1985 by 
de Arcangelis, Redner and Herrmann [106]. The random 
fuse network is an electrical metaphor of material break- 
down and consists of a lattice where the bonds are fuses 
with threshold ic distributed according to a given law 
P{ic)- If the current going through a fuse is larger than 
its threshold, the fuse breaks and its conductance goes 
to zero. The randomness of the thresholds models the 
heterogeneity of a material and the main question here 
is what happens if a macroscopic current / is injected 
in this system. If we assume that a given bond fails, 
the current will be redistributed according to Kirchoff 's 
laws on the neighbors of the failed fuse. For example, for 
an infinite two-dimensional square lattice, the current on 
the nearest neighbors of the failed bond is multiplied by 
C = 4/7r (see for example [86]). If the distribution P{ic) 
has a finite support [«m,«M], we can then produce the 
following argument. In the worst case, the failed bond 
had the smallest threshold possible im- The current on 
the neighbors is then C,im and the failure will certainly 
propagate if 

C«m > Uf (277) 

This result implies that if the disorder is weak (ie. the 
difference 5 = in — im is small), the failure will very 
likely propagate across the system and create a macro- 
scopic avalanche (which in material science is known as 
a 'brittle' behavior). In the opposite case of large 6, we 
will observe the formation of microcracks which will grow 
and eventually coalesce when the applied current / is in- 
creased ('ductile' behavior). This simple example shows 
that the flow redistribution process after a failure is a 
crucial element, and combined with the heterogeneity of 
the system can lead to a large variety of behavior. A 
general model of failure and current redistribution was 
for example discussed in [238] but we will focus in this 
chapter on cases where space plays a dominant role. 

Various studies examined the vulnerability of infras- 
tructures such as power grids [9, 192, 288], Internet, or 
transportation networks [105, 210, 321] and most of these 
studies are not concerned with spatial aspects but rather 
focused on how to measure the damage and the effect of 
various attack strategies. In particular, nodes with large 
betweenness centrality seem to be the weak points for all 
these networks, leading to the idea that global proper- 
ties of the network are needed in order to understand the 
stability of these systems. 

As many spatial networks with strong physical con- 



straints, power grids have an exponentially distributed 
degree (see section III.B.2) and in this case we can com- 
pute the effect of different removal strategies. In partic- 
ular, if we remove nodes at random, the critical fraction 
fc is given by [88] 

/c = 1 - (278) 

K — 1 

where k — {k'^)o/{k)o (where the subscript indicates 
that we compute the corresponding quantities before any 
node removal). Another strategy consist in attacking the 
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FIG. 78 Effect of random removal of nodes (white circles) 
and targeted attacks (black circles) on the relative size of the 
largest component of the EU power grid. In the inset, the 
effect of random removal and targeted attacks are shown for 
Italy (dashed lines) and France (continuous line). From [288]. 

most connected nodes in which case we can also com- 
pute a critical fraction using the argument developed 
in [88] (see Fig. (78) where the two strategies are illus- 
trated in the case of the European power grid). One 
obtains for a network with a degree distribution of the 
form P(fc) ^ exp(— fc/7), the following condition for the 
critical fraction fc 

l + /c(ln/c-l) = ^:^ (279) 

In the Fig. 79(a), we see that the critical fraction of node 
removal is obviously smaller in the targeted attack strat- 
egy. In the Fig. 79(b), the authors of [288] compare the 
theoretical mean-field value of fc with actual simulation 
results. The agreement is relatively good but there are 
some deviations for small values of 7: the simulated val- 
ues of fc seem to be relatively constant (of order 0.2±0.1) 
while the mean-field value goes to zero for 7 — >■ 1 (which 
corresponds roughly to the one-dimensional case). 

3. Failure of interdependent networks 

Space leads naturally to an increased level of inter- 
connection between critical infrastructure networks. Re- 
cent events such as the 2003 blackout have reinforced the 
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FIG. 79 (a) Comparison of the critical fraction needed to 
disrupt an exponential network with distribution P{k) ~ 
exp(— fc/7). (b) Estimated values for attacks on 33 different 
European power grids (circles) and the mean-field prediction 
of Eq. (279) (continuous line). From [288]. 



need for understanding the vulnerabilities of this spa- 
tially coupled network. In particular, in [67] the authors 
modeled the effect of the coupling between the electri- 
cal network and the Internet. They considered that the 
electrical blackout in Italy in September 2003 caused the 
shutdown of power stations which led to the failure of 
Internet nodes, and which in turn caused the breakdown 
of other power stations. The model proposed in [67] is 
made of two interconnected networks (see Fig. 80). The 
two interconnected networks have their own dynamics 
and can therefore not be considered as forming one sin- 
gle large network. The ability of a node in A to function 
will depend on its B-node (and vice- versa) which leads to 
the following process proposed in [67]: all B-nodes con- 
nected to A-nodes which do not belong the giant cluster 
in A have to be disrupted and vice-versa. This dynam- 
ics leads to surprising effects such the lowering of critical 
thresholds, and the appearance of an abrupt first-order 
transition with the size of the giant component going 
abruptly to zero at the transition point. In particular, 
for two connected scale- free networks, the interdependent 
dynamics leads to an additional fragility: while the de- 
gree heterogeneity is an asset against failure with many 
small degree nodes and a few hubs, when two networks 
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FIG. 80 Schematic representation of two interdependent net- 
works. Nodes in network B (communications network) are 
dependent on nodes in network A (power grid) for power; 
nodes in network A are dependent on network B for control 
information. From [67]. 



are interdependent, hubs can be dependent on small de- 
gree nodes which leads to an increased vulnerability of 
the system. We note here that this problem is not equiv- 
alent to one dynamics on two connected networks, which 
obviously reduces to the usual case of one single network. 



F. Space and the spread of disease 

The importance of space and mobility networks ap- 
pears very clearly in the study of epidemic spread. Infec- 
tious diseases indeed spread because people interact and 
travel and the modeling of disease spread thus requires - 
ideally - the knowledge of the origin-destination matrix 
and of the social network. 

We first recall here the main results for a prototype of 
disease spreading (the SIR model) on lattices and small- 
world networks. In particular, we discuss the effect of 
shortcuts on the spreading process. We then discuss the 
metapopulation model which enters the description of the 
disease spread among several subpopulations connected 
by a network. Finally, we end this chapter by describing 
recent studies of a malware propagation among WIFI 
routers and the virus spread using Bluetooth and MMS. 



1. SIR on lattices and small-world networks 

In theoretical epidemiology, it is customary to divide 
the population into compartments, such as infectious (/), 
susceptible (S*), or removed individuals (i?), and where 
the number of compartments depends on the specific na- 
ture of the disease [17]. This description assumes that 
we neglect fiuctuations among individuals in the same 
state. Two main models have been studied in the litera- 
ture which are: 

• The SIS model. In this susceptible individ- 

ual in contact with infected individuals can become 
himself infected and will heal after a certain time 
and come back to a susceptible state, meaning that 
he can catch again the disease. This could for ex- 
ample describe the common cold which does not 
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confer long lasting immunity and for which indi- 
viduals become susceptible again after infection. 

• The SIR model. The difference here is that an in- 
fected individual becomes immunized (or has re- 
covered) after a certain time immunized and can- 
not catch the disease again. This is for example 
the case for infantile disease that we cannot catch 
a second time (in the vast majority of cases). 

In the homogeneous mixing approximation, the equa- 
tions governing the evolution of the number of suscepti- 
bles, infected, and recovered are in the SIR case 



^ +XS 
= +f-il 



N 



(280) 



along with the condition S+I+R = N and where A is the 
probability per unit time to catch the disease and fj, is the 
recovery rate (ie. 1/fi is the typical recovery time). The 
basic reproductive rate [17] is here given by Rq — A//i 
and represents the average number of secondary infec- 
tions when one infected individual is introduced in a sus- 
ceptible population. We thus have an epidemic threshold 
\c = jJL below which the disease is not contagious enough 
to spread in the population and above which an extensive 
fraction of the population gets infected. 

In the homogeneous approximation, any pair of indi- 
vidual can interact which is not realistic in many cases 
(but which usually allows for an analytical approach). 
We have then to introduce the contact network for which 
the nodes are the individuals and the presence of a link 
between i and j denotes the possibility that the virus can 
spread from i to j (or from j to i). When we introduce 
a contact network, the important question which imme- 
diately arises is the one of the epidemic threshold and 
the effect of the network structure on its value. In this 
section, we will examine two cases. We will first examine 
the SIR model in the case of a lattice and we then pro- 
ceed to the Watts-Strogatz case where long-range links 
are added to a lattice. 



a. SIR model in space. In the pre-industrial times, dis- 
ease spread was mainly a spatial diffusion phenomenon. 
For instance, during the spread of the so-called Black 
Death, which occurred in the 14*'' century (see for exam- 
ple [250]), only few travelling means were available and 
typical trips were limited to relatively short distances on 
the time scale of one day. Historical studies confirm that 
the propagation (see Fig. 81) indeed followed a simple 
scheme, with a spatio-temporal spread mainly dominated 
by spatial diffusion. 

The simplest way to describe the spatio-temporal evo- 
lution of a disease can be found for example in the book 
[241] who described a spatial SIR model where S{x, t) de- 
notes the susceptible individuals and I{x,t) the infected 
and infectious individuals at time t and at location x. 
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FIG. 81 Spread of the Black Death during the 14th century. 
The epidemic front spreads in Europe with a velocity of the 
order of 200400 miles per year. From [94]. 



In the non-spatial version the evolution of the densities 
s = S/N , i = I/N and r = R/N is described by the set 
of equations (see for example [17, 32]) 




^,1 



(281) 



along with the condition that the sum of the three densi- 
ties is one s + i + r = 1. In these equations, the number of 
individuals is absorbed in the definition of the quantity A. 
In the spatial version, the quantities s and i now depend 
on space and in the simplest version it is assumed that 
simple diffusion is the cause of their dispersion and that 
they have the same diffusion constant. The equations 
governing their evolution then become 



(282) 



When D and X = fi ^ 0, the infection front grows 
as \/t while the combination oi D and A ^ leads 
in fact to a finite velocity. A simple way to see this is 
given by the following argument for the one-dimensional 
case [241]. Using dimensional arguments, we will use the 
following rescaling 
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t ^ XSot (283) 



and where Ro = XSq /fJ-is the basic reproductive rate and 
must be larger than one in order to observe an outbreak 
[17] (the quantity 5*0 is the initial number of susceptibles) . 
We then obtain 



—si + 
+si — 



(284) 
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We look now for a traveling wave solution of Eq. (284) of 
the form 



i{x, t) = i{x — vt) 
s(x, t) — s{x — vt) 



(285) 



where v is the wavespeed which needs to be determined. 
Inserting this form in the system Eq. (284) we obtain 



i" + vi' + i{S - ^) =0 
= 



S + VS — IS 



(286) 



where the prime denotes the derivation with respect to 
the variable z = x — vt. We look for a solution such that 
i(— oo) = i{oo) = and < s(— oo) < s(oo) — 1. Near 
the wave front, s w 1 and i w and we obtain 



i'' + vi' + t{l - ^) « 

Hq 

which can be solved and leads to 



i{z) cx exp 



{-v± Jw2 -4(1 



))z/2 



(287) 
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This solution cannot oscillate around i — which implies 
that the wavespeed v and Rq must satisfy the conditions 
i?o > 1 and 



.>2(1-^)V2 



(289) 



For most initial conditions, the travelling wave computed 
in the full non-linear system will evolve at the minimal 
velocity and we eventually obtain the epidemic wave ve- 
locity 



1 

Ro 
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We can apply this simple model to the historical case 
of the Black Death. Following [241], the number of in- 
dividuals in Europe at that time is ~ 85 millions which 
gives a density 5*0 ~ 20 /km^. The quantities A and /i 
are more difficult to determine as the plague transmits 
with fleas jumping from rats to humans. In [250] these 
parameters are estimated to be /i ~ 15/year and A ~ 1.0 
km'^/year. The basic reproductive rate is then Rq « 1.33 
which is not very large. The main difficulty here is the es- 
timate of the diffusion coefficient and if we assume that 
news spread at a velocity of around 160 kms per year, 
we obtain D w 10** kms^/year. Putting all the numbers 
together we then find a velocity of the order v « 700 
kms/year which is larger than historical estimates put in 
the range [300, 600]kms/year, but given the uncertainties 
on the parameter (especially for D), the spatial diffusion 
approximation is not unreasonable. 



b. SIR model in small-worlds. An important observation 
which was made by Grassberger [145] is that the SIR 
model on a graph can be mapped to the correspond- 
ing bond percolation problem. This mapping is however 
valid only for at least a very peaked distribution of in- 
fection time [189] (the main reason for the failure of the 
mapping in the case of random infection times is the exis- 
tence of correlations introduced between neighbors of the 
same site). As in [145], the authors of [311, 312] consider 
a SIR model where nodes stay infectious during a time 
T and that the transmission rate between two connected 
nodes is a random variable distributed according to the 
distribution P{X). For a given value of A, the probability 
1 — p not to transmit the disease is given by 



1 - p (1 - XSt) 



r/St 



(291) 



which in the continuous time limit and averaged over all 
values of A gives 



p{t) = 1- J dXP{X)e- 



(292) 



This quantity p gives the fraction of bonds completed in 
a given epidemic and we thus obtain a mapping between 
the SIR model and a bond percolation problem. If p 
is larger than the percolation threshold on the underly- 
ing lattice, there is a giant cluster which means that an 
extensive number of nodes were infected. For a peaked 
distribution of A the epidemic threshold is given by 



Ac = -Tln(l -pc) 



(293) 



and indicates the general trend that if pc is smaller then 
Ac is smaller too. On a square lattice, for p > pc — 1/2 
we obtain results such as the one shown in Fig. 82(a) 
where we see some large regions unaffected by the disease. 
The authors of [311, 312] studied the SIR model on a 
two-dimensional Watts-Strogatz model with a fraction 
(j) of shortcuts connecting randomly chosen nodes. The 
result is shown in the Fig. 82b and displays evidence that 
even a small fraction of long-range links is enough to 
'homogenize' the system and to break down the large 
unaffected regions. In addition, as shown in [249] the 
percolation threshold decreases with (p (see Fig. 77) which 
implies that the epidemic threshold is also smaller due to 
the shortcuts which facilitate the spread of the disease. 



The same mapping between the SIR model and bond 
percolation implies that for the scale-free network with 
exponent 7 and constructed on a lattice (such as in [310]) 
the epidemic threshold is non-zero for 7 > 2, while for 
a non-spatial scale-free network, we expect Pc — (in 
the limit of infinite networks, see for example the book 
[255] ) . In this spatial lattice, it thus seems that the addi- 
tion of shortcuts and the existence of hubs is not enough 
to counterbalance the effect of the short links and the 
underlying spatial structure. 
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FIG. 82 Snapshot of the final cluster of recovered nodes (gray 
nodes) after the epidemics has died out on (a) a lattice with 
periodic boundary conditions, and (b) on a two-dimensional 
Watts-Strogatz network with a fraction of shortcuts </!> — 0.01. 
From [311]. 



2. From Euclidian space to networks 

In modern times, the simple picture of pm'e spatial 
diffusion, with the possible addition of few shortcuts, 
does however not hold anymore. A striking example can 
be seen in the SARS outbreak in 2003. In Fig. 83 we 
show the spatio-temporal evolution of this disease which 
started in China. We can see on this figure that there 




FIG. 83 Global evolution of the SARS disease which started 
in China in 2002. The color code corresponds to time and 
goes from the Hong Kong outbreak in November 2002 to the 
end of the disease spread in March 2003. Courtesy V. Colizza. 



is a clear spatial component with the disease spreading 
in the south-west Asia but also with long-range jumps to 
Europe and the US. This clearly shows that pure spatial 
diffusion is not a good model anymore and that the global 
aspect of transportation networks needs to be included 
in the modeling. At a smaller scale, it has also been 
observed that epizootics such as the foot-and-mouth dis- 
ease outbreak in 2001 in Great Britain's livestock didn't 
follow a purely spatial pattern [188], but instead, the 
epidemic showed local stochastic spread and rare long- 
distance jumps. 

An approach to the problem is provided by metapop- 
ulation models which describe the spread of an infec- 
tious disease at a scale where a transportation network 
dominates. These models describe spatially structured 
populations such as cities or urban areas which inter- 
act through a given transportation network and have 
been the subject of many studies (we refer the interested 
reader to [32] and references therein). In the simplest 
version of the metapopulation model, the nodes of the 
network are the cities (or urban areas) and these cities 
are connected through a network defined by a weighted 
adjacency matrix pij. The element pij represents the 
probability per unit time that an individual chosen at 
random in city i will travel to city j. In the case where 
we assume that any individual of a city has the same 
probability to travel (in other words if we neglect any so- 
cial structure), the pij can then be estimated as Wij/Ni 
where Wij is the flow of travelers between i and j and Ni 
is the population of city i. The evolution of the number 
li (t) of infected individuals in the city i at time t is then 
given by an equation of the form 

dMt)=K{S,,I,,R,,...)+n,{{I}) (294) 

where K represents the local term and corresponds to 
the spread of the disease due to intra-city contamination. 
The travel term J7 depends on the mobility network and 
in the simple case of uniform travel probability, can be 
written as (in the limit of large J/) 

^^P]ilj ~ Pijh (295) 
j 

If the probability is constant, f2 is essentially the Lapla- 
cian on the underlying network and if the mobility graph 
is a lattice, we recover the usual Laplacian diffusion term. 
From a theoretical perspective it would be interesting to 
understand precisely the crossovers when we have differ- 
ent scales and networks interacting with each other. 

Theoretical studies on the metapopulation model such 
as the existence of a pandemic threshold were done in 
[41, 89, 91, 95]. In particular, if we assume that the travel 
probability is constant pij = p, the pandemic threshold 
has an expression of the form 

cx (i?o - 1)^77/ (296) 
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The disease will then spread all over the world if i?* > 1 
which implies that i?o > 1 (ie- the disease spreads in 
each subpopulation) and that the product p(fc^) is large 
enough. For scale-free networks (such as for the world- 
wide airline network), the quantity (fc^) is large which 
suggests that reducing travel (ie. decreasing p) is not ef- 
ficient. The fact that reducing travel is not an efficient 
strategies was also confirmed in theoretical and numeri- 
cal studies [41, 89, 92, 95]. Other studies on applications 
of metapopulation models to the SARS or Influenza can 
be found in [25, 92, 93]. 

More recently, in [25], small-scale commuting flows 
were added to the large-scale airline network. This is 
an interesting problem since we have now the superim- 
position of networks at different scales. In the Fig. 84, we 
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FIG. 84 Comparison of prevalence profiles without and with 
commuting flows, (a-b-c) Prevalence profiles for three conti- 
nental US regions, (d-e) Prevalence profiles for cities in New 
England. From [25]. 

show the different prevalence (defined as the fraction of 
infected individuals in the population) profiles obtained 
with and without commuting fiows. At a regional level, 
the prevalence is increased when commuting fiows are 
added in the model but the more spectacular effect is at 
a local level: the inclusion of short-range mobility doesn't 
change the magnitude of prevalences but largely increases 
the synchrony between outbreaks in neighboring cities 
[25]. 



3. WIFI and Bluetooth epidemiology 

a. WIFI epidemiology. In densely populated areas, the 
range of the WIFI routers is such that the corresponding 
geometric graph can percolate through the whole urban 
area (see Fig. 85). This fact could be used to spread a 
computer virus or a malware that could have large-scale 
consequences. An epidemiological model was developed 
in [164] in order to study various scenarios and to assess 
the vulnerability of these networks. Using a database 
containing the geographic location of the wireless routers, 
Hu et al. [164] focused on seven urban areas in the US. 
From the set of vertices (which are here the routers), 
the geometric network can be constructed for a given 
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FIG. 85 (a) Construction of the WIFI network as a random 
geometric graph (see section IV.A.l). (b) Giant component 
of the WIFI network in Chicago for different values Rint- (c) 
Degree distribution for different values of Rint- From [164]. 



value of the router's range Rint- In practice, the value of 
Rint fluctuates from 10 to 100 meters depending on the 
environment and the specific router type. In this study, 
this value Rint is assumed to be the same for all routers 
(but actually the study of geometric random graphs with 
random Rint could be of interest). An example of such 
geometric network is shown in Fig. 85 for the Chicago 
area with Rint = 45 meters. 

Hu et al. then use the compartmental approach of 
standard epidemiology in order to describe the spread of 
a computer virus. The usual scheme must however be 
enlarged in order to take into account the heterogeneity 
observed in the security levels of WIFI routers. For de- 
tails of the epidemiological model, we refer the interested 
reader to the paper [164]. Once this epidemiological de- 
scription is set up, the authors studied the spread of the 
virus in various settings and on the giant component. 
An example of the spread of the virus is shown for the 
Chicago area in Fig. 86. This figure displays a clear spa- 
tial diffusion of the disease which takes place on the giant 
component of the WIFI network. The time scales are ac- 
tually very important in this case and Hu et al. estimated 
that in the Chicago case, the malware rapidly propagates 
in a few hours and infects about 40% two weeks after the 
beginning of the infection (and about 10% to 55% for 
most areas considered in this study). 

The fraction of encrypted routers and geometrical con- 
straints of the urban area may have large consequences 
on the spread of the malware. In particular, areas with 
small bottlenecks may be very sensitive to the local den- 
sity of encrypted routers. This study allows for the future 
to have an estimate of the immunization threshold in or- 
der to stop the spread of a malware. 
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FIG. 86 Spatial spread of a SIR type disease in the Chicago 
area (results obtained for Ri„t = 45m). From [164]. 

b. Mobile phone disease. In [308], Wang et al. study the 
spreading patterns of a mobile phone virus. A mobile 
phone virus can spread through two very different mech- 
anisms. The first mechanism is through Bluetooth con- 
nections provided that the infected phone and the sus- 
ceptible one are separated by a short distance (typically 
from 10 to 30 meters). The second mechanism, a MMS 
virus can send a copy of itself to all the numbers found 
in the infected's phone contact book. We thus have a 
short range, spatial infection pattern superimposed with 
long-range infections. However, a cell phone virus can 
infect only phones with the right operating system. The 
market share m of an OS is then an important parameter 
in the study of mobile phone viruses spread. The authors 
of [308] used a database of a mobile phone provider and 
simulated the spread of disease using a SI type compart- 
ment model and studied the effect of various values of 
m. They found the existence of a threshold value (of or- 
der mj, ss 10%) above which there is a giant component 
of phones using the same OS, allowing a MMS virus to 
spread (added to the fact that even for m > mc, the dy- 
namics still depends on m and the larger this parameter, 
the faster the dynamics). At the time of this study, the 
share of the largest OS was about 3% and could explain 
why we didn't see yet a major mobile virus outbreak. 



VI. SUMMARY AND OUTLOOK 

In this article we review the most important effects of 
space on the structure of networks. In particular, the ex- 
istence of a cost associated to the length of edges leads to 
many important consequences such as a large clustering 
and a flat assortativity. The degree distribution is usually 
peaked due to the existence of strong physical constraints 
although broad degree distributions can be observed for 



spatial, non-planar networks. The betweenness centrality 
displays strong fluctuations which can be related to the 
interplay of space and degree. These different effects are 
by now relatively well understood and as shown in the 
section IV, there is now a wealth of various models which 
incorporate space at different levels. Finally, in section 
V we discussed processes on spatial networks for various 
processes such as the random walk, diseases spread etc. 

Despite these various advances, there are still many 
open problems which could represent interesting research 
directions both at the theoretical and the applied levels. 
We give here a non-exhaustive, subjective list of such 
open problems which seems particularly interesting. 

• Scale and description. In problems such as dis- 
ease spread, the description depends strongly on 
the scale of interest. At a small spatial scale (but 
large enough in order to allow for the use of contin- 
uous limits), we can use partial differential equa- 
tions. In contrast, at a larger scale, typically of the 
order given by a typical link of the relevant trans- 
portation network, the system is better described 
by metapopulation models. Given the interest of 
interdependent scales and networks, an interesting 
question is how we could integrate a 'mixed' de- 
scription able to interpolate between a continuous 
partial differential equation to a discrete network, 
metapopulation-like description. Another way to 
look at this question is too look for a continuous, 
pde-like, description of the spread over a network 
at a large scale. 

• Interdependent networks. Recent studies showed 
the importance of connected networks and in par- 
ticular that coupled dynamics could lead to new 
and surprising results. In particular, we saw that 
networks are coupled via space and this can lead to 
cascade of failures more important than what could 
be expected. However, a number of questions re- 
mained unsolved at this point. For example what 
are the conditions for a coupling to be relevant and 
to affect the single-network behavior ? In particu- 
lar, in the development of failures, it would be in- 
teresting to understand the space-time properties 
of blackouts. 

• Optimal spatial networks. 

Optimal networks are very important as we saw in 
this review (see section IV. E). In particular, there 
are many different directions such as understand- 
ing the hub location problem in presence of conges- 
tion, and more generally we believe that statisti- 
cal physics could bring interesting insights on these 
problems usually tackled by mathematicians and 
engineers expert in optimization. Also, as recent 
studies suggested it, evolution through resilience 
and noise shaped loopy networks and it would be 
interesting to understand quantitatively the forma- 
tion and the statistics of these loops. 
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• Connection with socio-economical indicators. Data 
on spatial networks and in particular, road and 
other infrastructure networks are now available and 
these networks have been the subject of many stud- 
ies. Also, with the emergence of geosocial appli- 
cations on mobile phones for example we can ex- 
pect interesting studies connecting spatial distri- 
butions and social behavior. This line of research 
already appeared in recent studies which tried to 
relate topological structures of networks with socio- 
economical indicators. In these studies, an im- 
portant question concerns the correlations between 
topological quantities and social factors. For ex- 
ample, it would be interesting to know if we can 
understand some aspects of the spatial distribution 
of crime rates in terms of topological indicators of 
the road network (such as the betweenness central- 
ity for example). 

• Evolution of transportation (and spatial) networks. 
How transportation networks evolve is an old prob- 
lem and was already the subject of many studies in 
the 1970s (see for example [150]). However, apart 
from some exceptions (see for example [214] and 
references therein) this problem is still not very well 
understood. We are now in the position where data 
and tools are available and we can expect some in- 
teresting developments in this area. In parallel to 
empirical studies, we also need to develop theoret- 
ical ideas and models in order to describe the evo- 
lution of spatial networks. 

• Urban .studies. More generally, infrastructure and 
transportation networks are part of urban systems 
and we believe that the current understanding of 
spatial networks could help in understanding the 
structure and evolution of these systems. In partic- 
ular, our knowledge of spatial networks could help 
in the understanding of important phenomenon 
such as urban sprawl and in the design of sustain- 
able cities. 
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